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Monitoring Control for Multi-Domain Processors 

The present invention relates to the field of controlling the monitoring of a 
processor that operates in different domains and in particular, in certain preferred 
5 embodiments, to the control of the debugging and tracing functions of such processors. 

In order to be able to monitor processors to find any faults that may be present 
debug and trace applications have been developed. In order for these applications to 
be able to function properly, these applications have conventionally had access to the 
10 entire processor. This is generally needed as it allows the monitoring functions to 
locate a fault that may be anywhere on the processing system. 

With processors that operate in more than one domain, for example, processors 
that operate in a secure domain and a non-secure domain, allowing a monitoring 

1 5 application access to the entire processor enables it to find a fault located anywhere on 
the system. However, such a comprehensive monitoring function may also allow data 
movement or leakage between the two domains. In the case of a secure and a non- 
secure domain this could potentially provide access to information that should be 
secure and could therefore introduce a weakness into the security of such a. system. 

20 However, not allowing debug or trace to access certain parts of a system, makes it 
difficult, if not impossible, to locate certain faults that may dccur in a non-accessible 
part of the system. l " 

Accordingly, it is desirable to provide an improved way of ..controlling 
25 monitoring functions to enable a processor to be monitored while reducing the risk of 
data leakage between domains. 

According to a first aspect of the present invention there is provided method of 
controlling a monitoring function of a processor, the processor being operable in at 
30 least two domains, comprising a first domain and a second domain, the first and 
second domains each comprising at least one mode, the method comprising the steps 
of: setting at least one control value, the at least one control value relating to a 



condition and being indicative of whether the monitoring function is allowable in the 
first domain; and only allowing initiation of the monitoring function in the first domain 
when the condition is present if its related control value indicates that the monitoring 
function is allowable. 

The invention recognises that monitoring functions on a processor that is 
operable in more than one domain are a potential source of data leakage between 
domains. However, not allowing monitoring functions access to the entire system may 
seriously limit the fault-finding abilities of such monitoring functions. By providing a 
method in which initiation of a monitoring function can be controlled with respect to 
certain conditions, the degree of monitoring allowed can be set according to these 
conditions. Thus, if conditions prevail that provide a greater risk of data leakage from 
the first domain initiation of monitoring functions within this domain are disabled. 
Under different conditions where security risks are judged to be low monitoring can be 
allowed. 

In preferred embodiments the first domain is a secure domain and the second 
domain is a non-secure domain. 

Embodiments of the present invention are particularly appropriate for use in 
systems having a secure and non-secure domain. Monitoring functions generally 
access the whole system and as such can introduce weaknesses into the security of the 
system. Embodiments of the present invention can control initiation of monitoring 
functions and can for example only allow all of the data to be accessed where a 
monitoring function is initiated from within the secure domain, and only allow 
monitoring of the non-secure domain where monitoring functions are initiated from the 
non-secure domain. 

Although the conditions that the at least one control value relates to can be a 
variety of things, in preferred embodiments they comprise domain, mode or type of 
monitoring system. 
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It is useful to be able to specify that a monitoring function is only initiated in a 
particular domain, for example, only in the non-secure domain. Additionally, 
specifying by mode, such as user mode or supervisor mode or specifying by type of 
monitoring function such as debug or trace allows further control of the monitoring 
5 function and gives an extra degree of control so that the need for monitoring can be 
balanced against the security risk in a variety of ways. 

In one embodiment the condition comprises a secure domain and the control 
value comprises a secure domain enable value, initiation of monitoring in the secure 
10 domain only being allowed if the secure domain enable value is set. Initiation of a 
monitoring function is always generally allowed in the non-secure domain. 

In other embodiments, the secure domain may include a secure user mode and 
the condition comprises a secure user mode, the control value comprising a secure user 
15 mode enable bit so that initiation of monitoring is only allowed in the secure user 
mode if the secure user bit has been set. 

Embodiments of the present invention are applicable to several different types 
of monitoring functions, including debug monitoring and trace monitoring. In some 
20 embodiments, the condition comprises a type of monitoring function and if the control 
value comprises a debug enable bit, initiation of debug in the first domain is only 
allowable if the debug enable bit has been set. 

In alternative embodiments, the condition comprises a trace monitoring 
25 function and the control value comprises a trace enable bit, initiation of trace in the 
first domain only being allowable if the control trace enable bit has been set. 

Preferably, the secure domain enable value comprises a secure debug enable bit 
and a secure trace enable bit, initiation of debug and trace in the secure domain only 
30 being allowable if the respective portions of the secure domain enable value are set. 
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Preferably, the method comprises setting a plurality of control values, each of 
the plurality of control values relating to a different condition; and only allowing 
initiation of the monitoring function in the first domain if any of the conditions are 
present if each of the control values related to a condition that is present indicates that 
5 the monitoring function is allowable. 
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25 



Embodiments of the present invention allow a range of conditions to be 
specified such that when any of the conditions are present monitoring is only initiated 
if all the control values related to these conditions are present. This provides a further 
degree of control over the granularity of the monitoring function and enables data 
leakage risks to be balanced against need for monitoring in a more sophisticated way. 



In preferred embodiments, the method further comprises the step of; setting a 
control indicator, the control indicator indicating that monitoring is only allowable for 
15 specified applications; and prior to initialising monitoring function checking an 
application identifier; and only allowing initiation of the monitoring function if the 
application currently running is one for which monitoring is allowable. 

Thus, in preferred embodiments, initiation of monitoring functions can also be 
20 controlled depending on the type of application running. This, further degree of 
control is provided so that specific applications can be monitored and others not. In 
many circumstances a particular application may be run by one user and another by 
another. Clearly in some circumstances it is advantageous to be able to inhibit data 
leakage between applications run by different users. 



Advantageously, the step of setting a control indicator comprises setting a 
control indicator stored in a predetermined position in a storage element. 

Preferably, the monitoring function comprises monitoring the processer and 
30 capturing diagnostic data, the method comprises the further step of following initiation 
of the monitoring function only allowing the capturing of diagnostic data in the first 
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domain while an application running on the processor is one for which monitoring is 
allowable. 

In this embodiment, not only is initiation of the monitoring function only 
5 allowed when a particular application is running, but the capture of diagnostic data by 
the monitoring function is also only allowed to continue while that application, or 
another that is allowable, continues to be running. This provides improved granularity 
- between applications allowing certain applications to be monitored and others not 

10 There are a variety of ways of setting the control value, in preferred 

embodiments the setting of at least one control is performed either by setting the 
control value via an input port or by setting the control value from the first domain. 

Thus, the control value that controls access of the monitoring function to the 
15 first domain can only be set via an input port or from within the first domain. Thus, 
generally access to the control value from the second domain is not allowed. 

Preferably, the method comprises a further step of blocking write access to the 
control value by the input port such that the step of setting the control value can 
20 henceforth only be performed by setting the control value from the first domain. 

This provides increased security to the control value and thus the processor 
after the further step has been performed. Thus, following this further step, the control 
value can only be accessed from within the first domain and thus monitoring of the 
25 first domain is controlled from within the first domain. Therefore anyone not having 
access to the first domain is inhibited from gaining access to it via a monitoring 
function. 

In some embodiments the first domain comprises a first user mode and a first 
30 privileged mode, in one embodiment the step of setting at least one control value in the 
first domain comprises setting the control value from the first privileged mode, in 
another alternative embodiment the step of setting the at least one control value in the 



first domain comprises inputting an authentication code from a mode that is not a first 
privileged mode and then setting the control value. 

Both of the above ways of setting the control value comprise either the need to 
gain access to a first privileged mode or the need to know an authentication code. 
Generally, input of an authentication code provides access to a privileged mode. 

A second aspect of the present invention comprises a processor 
operable in a first domain and a second domain the first and second domains each 
comprising at least one mode, the processor comprising: monitoring logic; a storage 
element operable to be set to contain at least one control value, the at least one control 
value relating to a condition and being indicative of whether operation of the 
monitoring logic is allowable in the first domain; and control logic operable to control 
initiation of the monitoring logic and only to allow initiation of the monitoring logic in 
the first domain when the condition is present if its related control value indicates that 
operation of the monitoring logic is allowable. 

The present invention will be described further, by way of example only, with 
reference to preferred embodiments thereof as illustrated in the accompanying 
drawings, in which: 

Figure 1 is a block diagram schematically illustrating a data processing 
apparatus in accordance with preferred embodiments of the present invention; 

Figure 2 schematically illustrates different programs operating in a non-secure 
domain and a secure domain; 

Figure 3 schematically illustrates a matrix of processing modes associated with 
different security domains; 

Figures 4 and 5 schematically illustrate different relationships between 
processing modes and security domains; 
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Figure 6 illustrates one programmer's model of a register bank of a processor 
depending upon the processing mode; 

5 Figure 7 illustrates an example of providing separate register banks for a secure 

domain and a non-secure domain; 

Figure 8 schematically illustrates a plurality of processing modes with switches 
between security domains being made via a separate monitor mode; 

10 

Figure 9 schematically illustrates a scenario for security domain switching 
using a mode switching software interrupt instruction; 

Figure 10 schematically illustrates one example of how non-secure interrupt 
15 requests and secure interrupt requests may be processed by the system; 

Figures 11A and 1 IB schematically illustrate an example of non-secure 
interrupt request processing and an example of secure interrupt request processing in 
accordance with Figure 10; 

20 

Figure 12 illustrates an alternative scheme for the handling of non-secure 
interrupt request signals and secure interrupt request signals compared to that 
illustrated in Figure 10; 

25 Figures 13A and 13B illustrate example scenarios for dealing with a non-secure 

interrupt request and a secure interrupt request in accordance with the scheme 
illustrated in Figure 12; 

Figure 14 is an example of a vector interrupt table; 

30 

Figure 15 schematically illustrates multiple vector interrupt tables associated 
with different security domains; 
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Figure 16 schematically illustrates an exception control register; 

Figure 17 is a flow diagram illustrating how an instruction attempting to 
5 change a processing status register in a manner that alters the security domain setting 
can generate a separate mode change exception which in turn triggers entry into the 
monitor mode and running of the monitor program; 

Figures 18 to 20 illustrate a view of different processing modes and scenario 
10 for switching between secure and non-secure domains in accordance with another 
example embodiment(s); 

Figure 21 is a diagram illustrating the memory management logic used in one 
embodiment of the present invention to control access to memory; 

15 

Figure 22 is a block diagram illustrating the memory management logic of a 
second embodiment of the present invention used to control access to memory; 

Figure 23 is a flow diagram illustrating the process performed in one 
20 embodiment of the present invention within the memory management logic to process 
a memory access request that specifies a virtual address; 

Figure 24 is a flow diagram illustrating the process performed in one 
embodiment of the present invention within the memory management logic to process 
25 a memory access request that specifies a physical address; 

Figure 25 schematically illustrates how the partition checker of preferred 
embodiments is operable to prevent access to a physical address within secure memory 
when the device issuing the memory access request is operating in a non-secure mode; 

30 

Figure 26 is a diagram illustrating the use of both a non-secure page table and a 
secure page table in preferred embodiments of the present invention; 
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Figure 27 is a diagram illustrating two forms of flag used within the main 
translation lookaside buffer (TLB) of preferred embodiments; 

5 Figure 28 illustrates how memory may be partitioned after a boot stage in one 

embodiment of the present invention; 

Figure 29 illustrates the mapping of the non-secure memory by the memory 
management unit following the performance of the boot partition in accordance with 
1 0 an embodiment of the present invention; 

Figure 30 illustrates how the rights of a part of memory can be altered to allow 
a secure application to share memory with a non-secure application in accordance with 
an embodiment of the present invention; 

15 

Figure 31 illustrates how devices may be connected to the external bus of the 
data processing apparatus in accordance with one embodiment of the present 
invention; 



Figure 32 is a block diagram illustrating how devices may be coupled to the 
external bus in accordance with the second embodiment of the present invention; 

Figure 33 schematically shows possible granularity of monitoring functions for 
different modes and applications running on a processor; 

Figure 34 shows possible ways of initiating different monitoring functions; 

Figure 35 shows a table of control values for controlling availability of 
different monitoring functions; 

Figure 36 shows a positive-edge triggered FLIP-FLOP view; 
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Figure 37 a scan chain cell; 

Figure 38 shows a plurality of scan chain cells in a scan chain; 
Figure 39 shows a debug TAP controller; 
Figure 40A shows a debug TAP controller with a JADI input; 
Figure 40B shows a scan chain cell with a bypass register; 

Figure 41 schematically illustrates a processor comprising a core, scan chains 
' and a Debug Status and Control Register; 

Figure 42 schematically illustrates the factors controlling debug or trace 
initialisation; 

Figures 43 A and 43B show a summary of debug granularity; 

Figure 44 schematically illustrates the granularity of debug while it is running; 



Figure 45A and 45B show monitor debug when debug is enabled in secure 
world and when it is not enabled respectively. 

Figure 1 is a block diagram illustrating a data processing apparatus in 
accordance with preferred embodiments of the present invention. The data processing 
apparatus incorporates a processor core 10 within which is provided an arithmetic 
logic unit (ALU) 16 arranged to execute sequences of instructions. Data required by 
the ALU 16 is stored within a register bank 14. The core 10 is provided with various 
monitoring functions to enable diagnostic data to be captured indicative of the 
activities of the processor core. As an example, an Embedded Trace Module (ETM) 
22 is provided for producing a real time trace of certain activities of the processor core 
in dependence on the contents of certain control registers 26 within the ETM 22 
defining which activities are to be traced. The trace signals are typically output to a 
trace buffer from where they can subsequently be analysed. A vectored interrupt 
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controller 21 is provided for managing the servicing of a plurality of interrupts which 
may be raised by various peripherals (). 

Further, as shown in Figure 1, another monitoring functionality that can be 
5 provided within the core 10 is a debug function, a debugging application external to 
the data processing apparatus being able to communicate with the core 10 via a Joint 
Test Access Group (JTAG) controller 18 which is coupled to one or more scan chains 
12. Information about the status of various parts of the processor core 10 can be 
output via the scan chains 12 and the JTAG controller 18 to the external debugging 
10 application. An In Circuit Emulator (ICE) 20 is used to store within registers 24 
conditions identifying when the debug functions should be started and stopped, and 
hence for example will be used to store breakpoints, watchpoints, etc. 

The core 10 is coupled to a system bus 40 via memory management logic 30 
15 which is arranged to manage memory access requests issued by the core 10 for access 
to locations in memory of the data processing apparatus. Certain parts of the memory 
may be embodied by memory units connected directly to the system bus 40, for 
example the Tightly Coupled Memory (TCM) 36, and the cache 38 illustrated in 
Figure 1. Additional devices may also be provided for accessing such memory,, for 
20 example a Direct Memory Access (DMA) controller 32. Typically, various control 
registers 34 will be provided for defining certain control parameters of the various 
elements of the chip, these control registers also being referred to herein as 
coprocessor 15 (CP 15) registers. 

25 The chip containing the core 10 will typically be coupled to an external bus 70 

(for example a bus operating in accordance with the "Advanced Microcontroller Bus 
Architecture" (AMBA) specification developed by ARM Limited) via an external bus 
interface 42, and various devices may be connected to the external bus 70. These 
devices may include master devices such as a digital signal processor (DSP) 50, or a 

30 direct memory access (DMA) controller 52, as well as various slave devices such as 
the boot ROM 44, the screen driver 46, the external memory 56, an input/output (I/O) 
interface 60 or a key storage unit 64. These various slave devices illustrated in Figure 
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1 can all be considered as incorporating parts of the overall memory of the data 
processing apparatus. For example, the boot ROM 44 will form part of the 
addressable memory of the data processing apparatus, as will the external memory 56. 
Further, devices such as the screen driver 46, I/O interface 60 and key storage unit 64 
will all include internal storage elements such as registers or buffers 48, 62, 66, 
respectively, which are all independently addressable as part of the overall memory of 
the data processing apparatus. As will be discussed in more detail later, a part of the 
memory, e.g. part of external memory 56, will be used to store one or more page tables 
- 58 defining information relevant to control of memory accesses. 



As will be appreciated by those skilled in the art, the external bus 70 will 
typically be provided with arbiter and decoder logic 54, the arbiter being used to 
arbitrate between multiple memory access requests issued by multiple master devices, 
for example the core 10, the DMA 32, the DSP 50, the DMA 52, etc, whilst the 
15 decoder will be used to determine which slave device on the external bus should 
handle any particular memory access request. 

Figure 2 schematically illustrates various programs running on a processing 
system having a secure domain and a non-secure domain. The system is provided with 

20 a monitor program 72 which executes at least partially in a monitor mode. A security 
status flag is accessible only within the monitor mode and may be written by the 
monitor program 72. The monitor program 72 is responsible for managing all changes 
between the secure domain and the non-secure domain in either direction. From a 
view external to the core the monitor mode is always secure and the monitor program 

25 is in secure memory. 

Within the non-secure domain there is provided a non-secure operating system 
74 and a plurality of non-secure application programs 76, 78 which execute in co- 
operation with the non-secure operating system 74. In the secure domain, a secure 
30 kernel program 80 is provided. The secure kernel program 80 can be considered to 
form a secure operating system. Typically such a secure kernel program 80 will be 
designed to provide only those functions which are essential to processing activities 
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which must be provided in the secure domain such that the secure kernel 80 can be as 
small and simple as possible since this will tend to make it more secure. A plurality of 
secure applications 82, 84 are illustrated as executing in combination with the secure 
kernel 80. 

5 

Figure 3 illustrates a matrix of processing modes associated with different 
security domains. In this particular example the processing modes are symmetrical 
' with respect to the security domain and accordingly Mode 1 and Mode 2 exist in both 
; secure and non-secure forms. 

10 

The monitor mode has the highest level of security access in the system and is 
the only mode entitled to switch the system between the non-secure domain and the 
secure domain in either direction. Thus, all domain switches take place via a switch to 
the monitor mode and the execution of the monitor program 72 within the monitor 
15 mode. 

Figure 4 schematically illustrates another set of non-secure domain processing 
modes 1, 2, 3, 4 and secure domain processing modes a, b, c. In contrast to the 
symmetric arrangement of Figure 3, Figure 4 shows that some of the processing modes 

20 may not be present in one or other of the security domains. The monitor mode 86 is 
again illustrated as straddling the non-secure domain and the secure domain. The 
monitor mode 86 can be considered a secure processing mode, since the secure status 
flag may be changed in this mode and monitor program 72 in the monitor mode has 
the ability to itself set the security status flag it effectively provides the ultimate level 

25 of security within the system as a whole. 

Figure 5 schematically illustrates another arrangement of processing modes 
with respect to security domains. In this arrangement both secure and non-secure 
domains are identified as well as a further domain. This further domain may be such 
30 that it is isolated from other parts of a system in a way that it does not need to interact 
with either of the secure domain or non-secure domain illustrated and as such the issue 
of to which of these it belongs to is not relevant. 
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As will be appreciated a processing system, such as a microprocessor is 
normally provided with a register bank 88 in which operand values may be stored. 
Figure 6 illustrates a programmer's model view of an example register bank with 
5 dedicated registers being provided for certain of the register numbers in certain of the 
processing modes. More particularly, the example of Figure 6 is an extension of the 
known ARM register bank (e.g. as provided in ARM7 processors of ARM Limited, 
Cambridge, England) which is provided with a dedicated saved program status 
register, a dedicated stack pointer register and a dedicated link register R14 for each 

10 processing mode, but in this case extended by the provision of a monitor mode. As 
illustrated in Figure 6, the fast interrupt mode has additional dedicated registers 
provided such that upon entry of the fast interrupt mode there is no need to save and 
then restore register contents from other modes. The monitor mode may in alternative 
embodiments also be provided with dedicated further registers in a similar manner to 

1 5 the fast interrupt mode so as to speed up processing of a security domain switch and 
reduce system latency associated with such switches. 

Figure 7 schematically illustrates another embodiment in which the register 
bank 88 is provided in the form of two complete and separate register banks that are 

20 respectively used in the secure domain and the non-secure domain. This is one way in 
which secure data stored within registers operable in the secure domain can be 
prevented from becoming accessible when a switch is made to the non-secure domain. 
However, this arrangement hinders the possibility of passing data from the non-secure 
domain to the secure domain as may be permitted and desirable by using the fast and 

25 efficient mechanism of placing it in a register which is accessible in both the non- 
secure domain and the secure domain. 

An important advantage of having secure register bank is to avoid the need for 
flushing the contents of registers before switching from one world to the other. If 
30 latency is not a critical issue, a simpler hardware system with no duplicated registers 
for the secure domain world may be used, e.g. Figure 6. The monitor mode is 
responsible switching from one domain to the other. Restoring context, saving 
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previous context, as well as flushing registers is performed by a monitor program at 
least partially executing in monitor mode. The system behaves thus like a 
virtualisation model. This type of embodiment is discussed further below. Reference 
should be made to, for example, the programmer's model of the ARM7 upon which 
the security features described herein build. 

Processor Modes 

Instead of duplicating modes in secure world, the same modes support both 
secure and non-secure domains (see Figure 8). Monitor mode is aware of the current 
status of the core, either secure or non-secure (e.g. as read from an S bit stored is a 
coprocessor configuration register). 

In the Figure 8 5 whenever an SMI (Software Monitor Interrupt instruction) 
occurs, the core enters monitor mode to switch properly from one world to the other. 

With reference to Figure 9: 

1 . The scheduler launches thread 1 

2. Thread 1 needs to perform a secure function => SMI secure call, the core enters 
monitor mode. Under hardware control the current PC and CPSR (current processor 
status register) are stored in R14 mon and SPSR_mon (saved processor status register 
for the monitor mode) and IRQ/FIQ interrupts are disabled. 

3. The monitor program does the following tasks: 
The S bit is set (the secure status flag). 

Saves at least R14_mon and SPSR_mon in a stack so that non-secure context 
cannot be lost if an exception occurs whilst the secure application is running. 
Checks there is a new thread to launch: secure thread 1. A mechanism (via 
thread ID table) indicates that thread 1 is active in the secure world. 
IRQ/FIQ interrupts are re-enabled. A secure application can then start in 
secure user mode. 
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4. Secure thread 1 runs until it finishes, then branches (SMI) onto the 'return from 
secure' function of the monitor program mode (IRQ/FIQ interrupts are then disabled 
when the core enters monitor mode) 

5. The 'return from secure' function does the following tasks: 

indicates that secure thread 1 is finished (e.g., in the case of a thread ID table, 
remove thread 1 from the table). 

Restore from stack non-secure context and flush required registers, so that no 
secure data can be read once return has been made to the non-secure domain. 
Then branches back to the non-secure domain with a SUBS instruction (this 
restores the program counter to the correct point and updates the status flags), 
restoring the PC (from restored R14_mon) and CPSR (from SPSR_mon). So, 
the return point in the non-secure domain is the instruction following the 
previously executed SMI in thread 1. 

6. Thread 1 executes until the end, then gives the hand back to the scheduler. 

In other embodiments it may be desired not to allow SMIs to occur in user 

modes. 

Secure World Entry 
Reset 



When a hardware reset occurs, the MMU is disabled and the ARM core 
(processor) branches to secure supervisor mode with the S bit set. Once the secure 
25 boot is terminated an SMI to go to monitor mode may be executed and the monitor 
can switch to the OS in non-secure world (non-secure svc mode) if desired. If it is 
desired to use a legacy OS this can simply boo in secure supervisor mode and ignore 
the secure state. 

30 SMI INSTRUCTION 

This instruction (a mode switching software interrupt instruction) can be called 
from any non-secure modes in the non-secure domain (as previously mentioned it may 
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be desired to restrict SMIs to privileged modes), but the target entry point determined 
by the associated vector is always fixed and within monitor mode. Its up to the SMI 
handler to branch to the proper secure function that must be run (e.g. controlled by an 
operand passed with the instruction). 

Passing parameters from non-secure world to secure world can be performed 
using the shared registers of the register bank within a Figure 6 type register bank. 

When a SMI occurs in non-secure world, the ARM core may do the following 
actions in hardware: 

Branch to SMI vector (in secure memory access is allowed since you will now 
be in monitor mode) into monitor mode 

Save PC into R14_mon and CPSR into SPSR_mon 
Set the S bit using the monitor program 

Start to execute secure exception handler in monitor mode (restore/save context 
in case of multi-threads) 

Branch to secure user mode (or another mode, like svc mode) to execute the 
appropriate function 

IRQ and FIQ are disabled while the core is in monitor mode (latency is 
increased) 

Secure World Exit 

There are two possibilities to exit secure world: 

The secure function is finished and we return into previous non-secure mode 
that had called this function. 

The secure function is interrupted by a non-secure exception (e.g. 
IRQ/FIQ/SMI). 



Normal End of Secure Function 
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The secure function terminates normally and we need to resume an application 
in the non-secure world at the instruction just after the SMI. In the secure user mode, i 
C SMF instruction is performed to return to monitor mode with the appropriate 
parameters corresponding to a 'return from secure world' routine. At this stage, the 
registers are flushed to avoid leakage of data between non-secure and secure worlds, 
then non-secure context general purpose registers are restored and non-secure banked 
registers are updated with the value they had in non-secure world. R14_mon and 
SPSR_mon thus get the appropriate values to resume the non-secure application after 
the SMI, -by-executing a 'MOVS PC, R14' instruction. 

Exit of Secure Function Due to a Non-Secure Exception 

In this case, the secure function is not finished and the secure context must be 
saved before going into the non-secure exception handler, whatever the interrupts are 
that need to be handled. 

Secure Interrupts 

There are several possibilities for secure interrupts. 

Two possible solutions are proposed which depend on: 
What kind of interrupt it is (secure or non-secure) 

What mode the core is in when the IRQ occurs (either in secure or in non- 
secure world) 

Solution One 

In this solution, two distinct pins are required to support secure and non-secure 
interrupts. 

While in Non Secure world, if 

an IRQ occurs, the core goes to IRQ mode to handle this interrupt as in ARM 
cores such as the ARM7 
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a SIRQ occurs, the core goes to monitor mode to save non-secure context and 
then to a secure IRQ handler to deal with the secure interrupt. 

While in Secure world, if 
5 - an SIRQ occurs, the core goes to the secure IRQ handler. The core does not 
leave the secure world 

an IRQ occurs, the core goes to monitor mode where secure context is saved, 
then to a non-secure IRQ handler to deal with this non-secure interrupt. 

1 0 In ot her words, when an interrupt that does not belong to the current world 

occurs, the core goes directly to monitor mode, otherwise it stays in the current world 
(see Figure 10). 

IRQ Occurring in Secure World 

15 

See Figure 1 1 A: 



1 . The scheduler launches thread 1 . 

2. Thread 1 needs to perform a secure function => SMI secure call, the core enters 
20 monitor mode. Current PC and CPSR are stored in R14_mon and SPSR_mon, 

IRQ/FIQ are disabled. 

3. The monitor handler (program) does the following tasks: 
The S bit is set. 

Saves at least R14_mon and SPSR_mon in a stack (and possibly other registers 
25 are also Pushed) so that non-secure context cannot be lost if an exception 

occurs .whilst the secure application is running. 

Checks there is a new thread to launch: secure thread 1 . A mechanism (via 
thread ID table) indicates that thread 1 is active in the secure world. 
Secure application can then start in the secure user mode. IRQ/FIQ are then re- 
30 enabled. 
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4. An IRQ occurs while secure thread 1 is running. The core jumps directly to 
monitor mode (specific vector) and stores current PC in R14_mon and CPSR in 
SPSR_mon in monitor mode, (IRQ/FIQ are then disabled). 

5. Secure context must be saved, previous non-secure context is restored. The 
5 monitor handler may be to IRQ mode to update R14_irq/SPSR_irq with appropriate 

values and then passes control to a non-secure IRQ handler. 

6. The IRQ handler services the IRQ, then gives control back to thread 1 in the 
non-secure world. By restoring SPRSJrq and R14_irq into the CPSR and PC, thread 
1 is now pointing onto the SMI instruction that has been interrupted. 

1 0 7. The SMI instruction is re-executed (same instruction as 2). 

8. The monitor handler sees this thread has previously been interrupted, and 
' restores the thread 1 context. It then branches to secure thread 1 in user mode, 

pointing onto the instruction that has been interrupted. 

9. Secure thread 1 runs until it finishes, then branches onto the 'return from 
1 5 secure' function in monitor mode (dedicated SMI). 

10. The 'return from secure' function does the following tasks: 

indicates that secure thread 1 is finished (i.e., in the case of a thread ID table, 
remove thread 1 from the table). 

restore from stack non-secure context and flush required registers, so that no 
20 secure data can be read once a return is made to non-secure world. 

branches back to the non-secure world with a SUBS instruction, restoring the 
PC (from restored R14_mon) and CPSR (from SPSR_mon). So, the return 
point in the non-secure world should be the instruction following the 
previously executed SMI in thread 1 . 
25 11. Thread 1 executes until the end, then gives control back to the scheduler. 

SIRQ Occurring in Non-Secure World 

See Figure 11B: 
1 . The schedule launches thread 1 
30 2. A SIRQ occurs while secure thread 1 is running. The core jumps directly to 
monitor mode (specific vector) and stores current PC in R14_mon and CPSR in 
SPSR_mon in monitor mode, IRQ/FIQ are then disabled. 
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3. Non-Secure context must be saved, then the core goes to a secure IRQ handler. 

4. The IRQ handler services the SIRQ, then gives control back to the monitor 
mode handler using an SMI with appropriate parameters. 

5. The monitor handler restores non-secure context so that a SUBS instruction 
makes the core return to the non-secure world and resumes the interrupted thread 1. 

6. Thread 1 executes until the end, then gives the hand back to the scheduler. 

The mechanism of Figure 1 1A has the advantage of providing a deterministic 
way to enter secure world. However, there are some problems associated with 
interrupt priority: e.g. while a SIRQ is running in secure interrupt handler, a non- 
secure IRQ with higher priority may occur. Once the non-secure IRQ is finished, there 
is a need to recreate the SIRQ event so that the core can resume the secure interrupt. 



Solution Two 

In this mechanism (See Figure 12) two distinct pins, or only one, may support 
secure and non-secure interrupts. Having two pins reduces interrupt latency. 



While in Non Secure world, if 

an IRQ occurs, the core goes to IRQ mode to handle this interrupt like in 
ARM7 systems 

a SIRQ occurs, the core goes to an IRQ handler where an SMI instruction will 
25 make the core branch to monitor mode to save non-secure context and then to a 

secure IRQ handler to deal with the secure interrupt. 
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While in a Secure world, if 

a SIRQ occurs, the core goes to the secure IRQ handler. The core does not 
leave the secure world 



22 



an IRQ occurs, the core goes to the secure IRQ handler where an SMI 
instruction will make the core branch to monitor mode (where secure context is 
saved), then to a non-secure IRQ handler to deal with this non-secure interrupt. 

IRQ Occurring In Secure World 

See Figure 13 A: 

1 . The schedule launches thread 1 . 

2. Thread 1 needs to perform a secure function => SMI secure call, the core enters 
monitor mode. Current PC and CPSR are stored in R14_mon and SPSR_mon, 
IRQ/FIQ are disabled. 

3. The monitor handler does the following tasks: 
The S bit is set. 

Saves at least R14_mon and SPSR_mon in a stack (eventually other registers) 
so that non-secure context cannot be lost if an exception occurs whilst the 
secure application is running. 

Checks there is a new thread to launch: secure thread 1. A mechanism (via 
thread ID table) indicates that thread 1 is active in the secure world. 
Secure application can then start in the secure user mode. IRQ/FIQ are re- 
enabled. 

4. An IRQ occurs while secure thread 1 is running. The core jumps directly to 
secure IRQ mode. 

5. The core stores current PC in R14_irq and CPSR in SPSRJrq. The IRQ 
handler detects this is a non-secure interrupt and performs a SMI to enter monitor 
mode with appropriate parameters. 

6. Secure context must be saved, previous non-secure context is restored. The 
monitor handler knows where the SMI came from by reading the CPSR. It can also go 
to IRQ mode to read R14_irq/SPSR_irq to save properly secure context. It can also 
save in these same registers the non-secure context that must be restored once the IRQ 
routine will be finished. 
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7. The IRQ handler services the IRQ, then gives control back to thread 1 in the 
non-secure world. By restoring SPRSJrq and R14_irq into the CPSR and PC, the 
core is now pointing onto the SMI instruction that has been interrupted. 

8. The SMI instruction is re-executed (same instruction as 2). 

9. The monitor handler sees this thread has previously been interrupted, and 
restores the thread 1 context. It then branches to secure thread 1 in user mode, 
pointing to the instruction that has been interrupted. 

10. Secure thread 1 runs until it finishes, then branches onto the 'return from 
secure'; function in monitor mode (dedicated SMI). 

1 1 . The 'return from secure' function does the following tasks: 

indicates that secure thread 1 is finished (i.e., in the case of a thread ID table, 
remove thread 1 from the table). 

restores from stack non-secure context and flushes required registers, so that no 
secure information can be read once we return in non-secure world, 
branches back to the non-secure world with a SUBS instruction, restoring the 
PC (from restored R14_mon) and CPSR (from SPSR_mon). The return point 
in the non-secure world should be the instruction following the previously 
executed SMI in thread 1 . 
11. Thread 1 executes until the end, then gives the hand back to the scheduler. 

SIRQ Occurring in Non-Secure World 



See Figure 13B: 



25 1 . The schedule launches thread 1 . 

2. A SIRQ occurs while secure thread 1 is running. 

3. The core jumps directly irq mode and stores current PC in R14_irq and CPSR 
in SPSR_irq. IRQ is then disabled. The IRQ handler detects this is a SIRQ and 
performs a SMI instruction with appropriate parameters, 

30 4. Once in monitor mode, non-secure context must be saved, then the core goes to 
a secure IRQ handler. 
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5. The secure IRQ handler services the SIRQ service routine, then gives control 
back to monitor with SMI with appropriate parameters. 

6. The monitor handler restores non-secure context so that a SUBS instruction 
makes the core returns to non-secure world and resumes the interrupted IRQ handler. 

7. The IRQ handler may then return to the non-secure thread by performing a 
SUBS. 

8. Thread 1 executes until the end, then gives control back to the scheduler. 
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With the mechanism of Figure 12, there is no need to recreate the SIRQ event 
in the case of nested interrupts, but there is no guarantee that secure interrupts will be 
performed. 



Exception Vectors 
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At least two physical vector tables are kept (although from a virtual address 
point of view they may appear as a single vector table), one for the non-secure world 
in non-secure memory, the one for the secure world in secure memory (not accessible 
from non-secure world). The different virtual to physical memory mappings used in 
the secure and non-secure worlds effectively allow the same virtual memory addresses 
to access different vector tables stored in physical memory. The monitor mode may 
always use flat memory mapping to provide a third vector table in physical memory. 
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If the interrupts follow the Figure 12 mechanism, there would be the following 
vectors shown in Figure 14 for each table. This vector set is duplicated in both secure 
and non-secure memory. 



Exception 


Vector Offset 


Corresponding Mode 


Reset 


0x00 


Supervisor Mode (S bit 
set) 


Undef 


0x04 


Monitor mode/Undef 
mode 
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SWI 


0x08 


Supervisor mode/Monitor 
mode 


Prefetch Abort 


OxOC 


Abort mode/Monitor mode 


Data Abort 


0x10 


Abort mode/Monitor 
Mode 


SMI 


0x14 


Undef mode/Monitor 
mode 


IRQ/SIRQ 


0x18 


IRQ mode 


FIQ 


OxlC 


FIQ mode 



NB. The Reset entry is only in the secure vector table. When a Reset is 
performed in non secure world, the core hardware forces entry of supervisor mode and 
setting of the S bit so that the Reset vector can be accessed in secure memory. 

Figure 15 illustrates three exception vector tables respectively applicable to a 
secure mode, a non-secure mode and the monitor mode. These exception vector tables 
may be programmed with exception vectors in order to match the requirements and 
characteristics of the secure and non-secure operating systems. Each of the exception 
vector tables may have an associated vector table base address register within CP 15 
storing a base address pointing to that table within memory. When an exception 
occurs the hardware will reference the vector table base address register corresponding 
to the current state of the system to.determine the base address of the vector table to be 
used. Alternatively, the different virtual to physical memory mappings applied in the 
different modes may be used to separate the three different vector table sstored at 
different physical memory addresses. As illustrated in Figure 16, an exception trap 
mask register is provided in a system (configuration controlling) coprocessor (CP 15) 
associated with the processor core. This exception trap mask register provides flags 
associated with respective exception types. These flags indicate whether the hardware 
should operate to direct processing to either the vector for the exception concerned 
within its current domain or should force a switch to the monitor mode (which is a 
type of secure mode) and then follow the vector in the monitor mode vector table. The 
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exception trap mask register (exception control register) is only writable from the 
monitor mode. It may be that read access is also prevented to the exception trap mask 
register when in a non-secure mode. It will be seen that the exception trap mask 
register of Figure 16 does not include a flag for the reset vector as the system is 
configured to always force this to jump to the reset vector in the secure supervisor 
mode as specified in the secure vector table in order to ensure a secure boot and 
backwards compatibility. It will be seen that in Figure 15, for the sake of 
completeness, reset vectors have been shown in the vector tables other than the secure 
supervisor mode secure vector table. 

Figure 16 also illustrates that the flags for the different exception types within 
the exception trap mask register are programmable, such as by the monitor program 
during secure boot. Alternatively, some or all of the flags may in certain 
implementations be provided by physical input signals, e.g. the secure interrupt flag 
SIRQ may be hardwired to always force monitor mode entry and execution of the 
corresponding monitor mode secure interrupt request vector when a secure interrupt 
signal is received. Figure 16 illustrates only that portion of the exception trap register 
concerned with non-secure domain exceptions, a similar set of programmable bits will 
be provided for secure domain exceptions. 

Whilst it will be understood from the above that at one level the hardware acts 
to either force an interrupt to be serviced by the current domain exception handler or 
the monitor mode exception handler depending upon the exception control register 
flags, this is only the first level of control that is applied. As an example, it is possible 
for an exception to occur in the secure mode, the secure mode exception vector to be 
followed to the secure mode exception handler, but this secure mode exception handler 
then decide that the exception is of a nature that it is better dealt with by the non- 
secure exception handler and accordingly utilise an SMI instruction to switch to the 
non-secure mode and invoke the non-secure exception handler. The converse is also 
possible where the hardware might act to initiate the non-secure exception handler, but 
this then execute instructions which direct processing to the secure exception handler 
or the monitor mode exception handler. 
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Figure 17 is a flow diagram schematically illustrating the operation of the 
system so as to support another possible type of switching request associated with a 
new type of exception. At step 98 the hardware detects any instruction which is 
attempting to change to monitor mode as indicate in a current program status register 
(CPSR). When such an attempt is detected, then a new type of exception is triggered, 
: this being referred to herein as a CPSR violation exception. The generation of this 
CPSR violation exception at step 100 results in reference to an appropriate exception 
vector within the monitor mode and the monitor program is run at step 102 to handle 
the CPSR violation exception. 

It will be appreciated that the mechanisms for initiating a switch between 
secure domain and non-secure domain discussed in relation to Figure 17 may be 
provided in addition to support for the SMI instruction previously discussed. This 
exception mechanism may be provided to respond to unauthorised attempts to switch 
mode as all authorised attempts should be made via an SMI instruction. Alternatively, 
such a mechanism may be legitimate ways to switch between the secure domain and 
the non-secure domain or may be provided in order to give backwards compatibility 
with existing code which, for example, might seek to clear the processing status 
register as part of its normal operation even though it was not truly trying to make an 
unauthorised attempt to switch between the secure domain and the non-secure domain. 

A description of an alternative embodiment(s) of the present technique 
considered from a programmer's model view is given below in relation to Figures 18 
to 20 as follows: 

In the following description, we will use the following terms that must be 
understood in the context of an ARM processor as designed by ARM Limited, of 
Cambridge, England. 

S bit : Secure state bit, contained in a dedicated CP 15 register. 
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'Secure/Non-Secure state'. This state is defined by the S bit value. It indicates 
whether the core may access the Secure world (when it is in Secure state, i.e. 
S=l) or is restricted to the Non-secure world only (S=0). Note that the Monitor 
mode (see further) overrides the S bit status. 

'Non-Secure World' groups all hardware/software accessible to non-secure 
applications that do not require security. 

'Secure World' groups all hardware/software (core, memory...) that is only 
accessible when we execute secure code. 
- Monitor mode: new mode that is responsible for switching the core between 

the Secure and Non-secure state. 

As a brief summary 

The core can always access the Non-secure world. 

The core can access the Secure world only when it is in Secure state or 

Monitor mode. 

SMI: Software Monitor Interrupt: New instruction that will make the core enter 
the Monitor mode through a dedicated SMI exception vector.Thread ID': is the 
identifier associated to each thread (controlled by an OS). For some types of 
OS where the OS runs in non-secure world, each time a secure fimction is 
called, it will be necessary to pass as a parameter the current thread ID to link 
the secure function to its calling non-secure application. The secure world can 
thus support multi-threads. 

Secure Interrupt defines an interrupt generated by a Secure peripheral. 
Programmer's model 

Carbon Core Overview 

The concept of the Carbon architecture, which is the term used herein for processors 
using the present techniques, consists in separating two worlds, one secure and 
one non-secure. The secure world must not leak any data to non-secure world. 
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In the proposed solution, the secure and non-secure states will share the same 
(existing) register bank. As a consequence, all current modes present in ARM cores 
(Abort, Undef, Irq, User, ...) will exist in each state. 

The core will know it operates in secure or non-secure state thanks to a new 
state bit, the S (secure) bit, instantiated in a dedicated CP 15 register. 

Controlling which instruction or event is allowed to modify the S bit, i.e. to 
' change froirrone state to the other," is a key feature of the security of the system. The 
current solution proposes to add a new mode, the Monitor mode, that will "supervise" 
switching between the two states. The Monitor mode, by writing to the appropriate 
CP 15 register, would be the only one allowed to alter the S bit. 

Finally, we propose to add some flexibility to the exception handling. All 
exceptions, apart from the reset, would be handled either in the state where they 
happened, or would be directed to the Monitor mode. This would be left configurable 
thanks to a dedicated CP 15 register. 

The details of this solution are discussed in the following paragraphs. 

Processor state and modes 

Carbon new features 

Secure or Non-secure state (S bit) 

One major feature of the Carbon core is the presence of the S bit, which 
indicates whether the core is in a Secure (S=l) or Non-secure (S=0) state. When in 
Secure state, the core would be able to access any data in the Secure or Non-secure 
worlds. When in Non-Secure state, the core would be restricted to the Non-secure 
world only. 

The only exception to this rule concerns the Monitor mode, which overrides the 
S bit information. Even when S=0, the core will perform Secure privileged accesses 
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when it is in Monitor mode. See next paragraph, Monitor mode, for further 
information 

The S bit can only be read and written in Monitor mode. Whatever the S bit 
5 value, if any other mode tries to access it, this will be either ignored or result in an 
Undefined exception. 

All exceptions, apart from Reset, have no effect on the Secure state bit. On 
Reset, ~the S bit will be set, and the core will start in Supervisor mode. Refer to the 
10 boot section for detailed information. 

Secure/Nonsecure states are separate and operate independently of the 
ARM/Thumb/Java states. 

15 Monitor mode 

One other important feature of the Carbon system is the creation of a new 
mode, the Monitor mode. This will be used to control the core switching between the 
Secure and Non-secure states . It will always be considered as a secure mode, i.e. 
whatever the value of the S bit, the core will always perform Secure Privileged 
20 accesses to the external world when it is in Monitor mode. 

Any Secure privileged mode (i.e. privileged modes when S=l) would be able 
to switch to Monitor mode by simply writing the CPSR mode bits (MSR, MOVS, or 
equivalent instruction). However, this would be forbidden in any Non-secure mode or 
25 Secure user mode. If this ever happens, the instruction would be ignored or cause an 
exception. 

There may be a need for a dedicated CPSR violation exception. This exception 
would be raised by any attempt to switch to Monitor mode by directly writing the 
30 CPSR from any Non-secure mode or Secure user mode. 
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All exceptions except Reset are in effect disabled when Monitor mode is 

active: 

• all interrupts are masked; 

• all memory exceptions are either ignored or cause a fatal exception. 
5 • undefined/SWI/SMI are ignored or cause a fatal exception. 

When entering Monitor mode, the interrupts are automatically disabled and the 
system monitor should be written such that none of the other types of exception can 
happen while the system monitor is running. 

10 

Monitor mode needs to have some private registers. This solution proposes 
that we only duplicate the minimal set of registers, i.e R13 (sp_mon), R14 (lr_mon) 
and SPSR (spsr_mon). 

15 In Monitor mode, the MMU will be disabled (flat address map) as well as the 

MPU or partition checker (the Monitor mode will always perform secure privileged 
external accesses). However, specially programmed MPU region attributes 
(cacheability, ...) would still be active. 

20 New instruction 

This proposal requires adding one new instruction to the existing ARM 
instruction set. 

The SMI (Software Monitor Interrupt) instruction would be used to enter the 
25 Monitor mode, branching at a fixed SMI exception vector. This instruction would be 
mainly used to indicate to the Monitor to swap between the Non-secure and Secure 
State. 

As an alternative (or in addition) it would be possible to add a new instruction 
30 to allow the Monitor mode to save/restore the state of any other mode onto/from the 
Monitor stack to improve context switching performance. 



Processor Modes 

As discussed in the previous paragraph, only one new mode is added in the 
core, the Monitor mode. All existing modes remain available, and will exist both in 
the secure and non-secure states. 

In fact, Carbon users will see the structure illustrated in Figure 18. 
Processor registers 

This embodiment proposes that the secure and the non-secure worlds share the 
same register bank. This implies that, when switching from one world to the other 
through the Monitor mode, the system monitor will need to save the first world 
context, and create (or restore) a context in the second world. 

Passing parameters becomes an easy task: any data contained in a register in 
the first world will be available in the same register in the second world once the 
system monitor has switched the S bit. 

However, apart from a limited number of registers dedicated to passing 
parameters, which will need to be strictly controlled, all other registers will need to be 
flushed when passing from Secure to Non-secure state in order to avoid any leak of 
Secure data. This will need to be ensured by the Monitor kernel. 

The possibility of implementing a hardware mechanism or a new instruction to 
directly flush the registers when switching from Secure to Non-secure state is also a 
possibility. 

Another solution proposed involves duplicating all (or most of) the existing 
register bank, thus having two physically separated register banks between the Secure 
and Non-secure state. This solution has the main advantage of clearly separating the 
secure and non-secure data contained in the registers. It also allows fast context 
switching between the secure and non-secure states. However, the drawback is that 
passing parameters through registers becomes difficult, unless we create some 
dedicated instructions to allow the secure world access the non-secure registers 
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Figure 19 illustrates the available registers depending on the processor mode. 
Note that the processor state has no impact on this topic. 

5 Exceptions 

Secure interrupts 
Current Solution 

It is currently proposed to keep the same interrupt pins as in the current cores, 
i.e. IRQ and FIQ. In association with the Exception Trap Mask register (defined later 
10 in the document), there should be sufficient flexibility for any system to implement 
and handle different kind of interrupts. 

VIC enhancement 

We could enhance the VIC (Vectored Interrupt Controller) in the following 
15 way: the VIC may contain one Secure information bit associated to each vectored 
address. This bit would be programmable by the Monitor or Secure privileged modes 
only. It would indicate whether the considered interrupt should be treated as Secure, 
and thus should be handled on the Secure side. 

20 We w ould also add two new Vector Address registers, one for all Secure 

Interrupts happening in Non-Secure state, the other one for all Non-Secure interrupts 
happening in Secure state. 

The S bit information contained in CP 15 would be also available to the VIC as 
25 a new VIC input. 

The following table summarizes the different possible scenarios, depending on 
the status of the incoming interrupt (Secure or Non-secure, indicated by the S bit 
associated to each interrupt line) and the state of the core (S bit in CP 15 = S input 
30 signal on the VIC). 
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Core in secure state 
(CP15 - S=l) 


Core in Non-secure state 
(CP15 - S=0) 


Secure Interrupt 

<> 


No need to switch between worlds. 
The VIC directly presents to th< 
core the Secure address associatec 
to the interrupt line. The core 
simply has to branch at this address 
where it should find the associatec 
ISR. 


The VIC has no Vector associated 
tto this interrupt in the Non-secure 
1 domain. It thus presents to the core 
^the address contained in the Vector 
>address register dedicated to all 
1 Secure interrupts occurring in Non- 
secure world. The core, still in Non- 
secure world, then branches to this 
address, where it should find an 
SMI instruction to switch to Secure 
world. Once in Secure world, it 
would be able to have access to the 
correct ISR. 


Non-Secure 
Interrupt 

< 
< 

<; 

s 
\ 
a 


The VIC has no Vector associated 
to this interrupt in the Secure 
domain. It thus presents to the core 
the address contained in the Vector 
address register dedicated to all 
Von-secure interrupts occurring in 
Secure world. The core, still in< 
Secure world, then branches to this 
iddress wherp it chmilH f%r\A 
SMI instruction to switch to Non- 
ecure world. Once in Non-secure 
vorld, it would be able to have 
tccess to the correct ISR. 


No need to switch between worlds. 
The VIC directly presents to the 
core the Non-secure address 
associated to the interrupt line. The 
core simply has to branch at this 
address where it should find the 
associated Non-secure ISR. 



Exception handling configurability 
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In order to improve Carbon flexibility, a new register, the Exception Trap 
Mask, would be added in CP 15. This register would contain the following bits: 

Bit 0: Undef exception (Non-secure state) 

Bit 1 : S WI exception (Non-secure state) 

5 . - Bit 2: Prefetch abort exception (Non-secure state) 

Bit 3: Data abort exception (Non-secure state) 

Bit 4: IRQ exception . (Non-secure state) 

Bit 5: FIQ exception (Non-secure state) 

Bit 6: SMI exception- (both Non-secure/Secure states) 

10 - Bit 16: Undef exception (Secure state) 

Bit 17: SWI exception (Secure state) 

Bit 18: Prefetch abort exception (Secure state) 

Bit 19: Data abort exception (Secure state) 

Bit 20: IRQ exception (Secure state) 

15 - Bit 21 : FIQ exception (Secure state) 

The Reset exception does not have any corresponding bit in this register. Reset 
will always cause the core to enter the Secure supervisor mode through its dedicated 
vector. 
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If the bit is set, the corresponding exception makes the core enter the Monitor 
mode. Otherwise, the exception will be handled in its corresponding handler in the 
world where it occurred. 



25 This register would only be visible in Monitor mode. Any instruction trying to 

access it in any other mode would be ignored. 

This register should be initialized to a system-specific value, depending upon 
whether the system supports a monitor or not. This functionality could be controlled 
30 by a VIC. 



36 



10 



Exception vectors tables 

As there will be separate Secure and Non-secure worlds, we will also need 
separate Secure and Non-secure exception vectors tables. 

Moreover, as the Monitor can also trap some exceptions, we may also need a 
third exception vectors table dedicated to the Monitor. 

The following table summarizes those three different exception vectors tables: 
In non-secure memory: 



Address 

0x00 


Exception 


Mode 


Automatically accessor! whpn 


0x04 


Undef 


Undef 


Undefined instruction executed when core is 
in Non-Secure state and Exception Trap 
Mask reg [Non-secure Undef]=0 


0x08 


SWI 


Supervisor 


SWI instruction executed when core is in 
Non-Secure state and Exception Trap Mask 
reg [Non-secure SWI]=0 


OxOC 


Prefetch Abort 


Abort 


Aborted instruction when core is in Non- 
Secure state and Exception Trap Mask reg 
[Non-secure PAbort]=0 


0x10 


Data Abort 


Abort 


Aborted data when core is in Non-Secure 
state and Exception Trap Mask reg [Non- 
secure D Abort] =0 


0x14 


Reserved 






0x18 


IRQ 


[RQ 


IRQ pin asserted when core is in Non- 
Secure state and Exception Trap Mask reg 
[Non-secure IRQ]=0 


0x1 C 


FIQ 


< 


FIQ pin asserted when core is in Non- 
Secure state and Exception Trap Mask reg 
Non-secure FIQ]=0 



37 



In secure memory: 



Address 



0x00 



0x04 



0x08 



Undef 



Exception 

Reset* 



Mode 



Undef 



SWI 



Supervisor 



Supervisor 



Reset pin asserted 
Undefined instruction executed when core is 
in Secure state and Exception Trap Mask 
reg [Secure Undef]=0 



SWI instruction executed when core is in 
Secure state and Exception Trap Mask reg 
[Secure SWI]=0 



Automatically accessed when 



OxOC 



0x10 



0x14 



0x18 



OxlC 



Prefetch Abort 



Abort 



Aborted instruction when core is in Secure 
state and Exception Trap Mask reg [Secure 
Abort] =0 



Data Abort 



Abort 



Aborted data when core is in Secure state 
and Exception Trap Mask reg [Secure 
Dabort]=0 



Reserved 



RQ 



IRQ 



•IQ 



RQ pin asserted when core is in Secure 
state and Exception Trap Mask reg [Secure 
RQ]=0 



FIQ 



FIQ pin asserted when core is in Secure 
state and Exception Trap Mask reg [Secure 
FIQ]=0 



* Refer to "Boot" section for further explanation on the Reset mechanism 
5 In Monitor memory (flat mapping): 



Address 


Exception 


Mode 


Automatically accessed when 


0x00 









38 



0x04 


Undef 


Monitor 


Undefined instruction executed when 
core is in Secure state and Exception Trap 
Mask reg [Secure Undef]=l 
core is in Non-secure state and Exception 
Trap Mask reg [Non-secure Undef]=l 


0x08 


SWI 


Monitor 


SWI instruction executed when 

core is in Secure state and Exception Trap 

Mask reg [Secure S WI]=1 

core is in Non-secure state and Exception 

Trap Mask reg [Non-secure SWI]=1 


OxOC 


Prefetch Abort 


Monitor 


Aborted instruction when 

core is in Secure state and Exception Trap 

Mask reg [Secure IAbort]=l 

core is in Non-secure state and Exception 

Trap Mask reg [Non-secure Iabort]=l 


0x10 


Data Abort 


Monitor 


Aborted data when 

core is in Secure state and Exception Trap 
Mask reg [Secure PAbort]=l 
core is in Non-secure state and Exception 
Trap Mask reg [Non-secure Pabort]=l 


0x14 


SMI 


Monitor 




0x18 


IRQ 


Monitor 


- IRQ pin asserted when 

core is in Secure state and Exception Trap 

Mask reg [Secure IRQ]=1 

core is in Non-secure state and Exception 

Trap Mask reg [Non-secure IRQ]=1 


")x1C 




VIonitor 


- FIQ pin asserted when 

sore is in Secure state and Exception Trap 

Mask reg [Secure FIQ]=1 

:ore is in Non-secure state and Exception 

Trap Mask reg [Non-secure FIQ]=1 
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In Monitor mode, the exceptions vectors may be duplicated, so that each 
exception will have two different associated vector: 

One for the exception arising in Non-sebure state 

One for the exception arising in Secure state 

This may be useful to reduce the exception latency, because the monitor kernel 
does not have any more the need to detect the originating state where the exception 
occurred. 

Note that this feature may be limited to a few exceptions, the SMI being one of 
the most suitable candidates to improve the switching between the Secure and Non- 
secure states. 

Switching between worlds 

When switching between states, the Monitor mode must save the context of the 
first state on its Monitor stack, and restore the second state context from the Monitor 
stack. 

The Monitor mode thus needs to have access to any register of any other 
modes, including the private registers (rl4, SPSR, ..). 

To handle this, the proposed solution consists in giving any privilege mode in 
Secure state the rights to directly switch to Monitor mode by simply writing the CPSR. 

With such a system, switching between worlds is performed as follows: 
enter Monitor mode 
set the S bit 

switch to supervisor mode - save the supervisor registers on the MONITOR 
stack (of course the supervisor mode will need to have access to the Monitor 
stack pointer, but this can be easily done, for example by using a common 
register (RO to R8)) 

switch to System mode - save the registers (=same as the user mode) on the 
Monitor stack 
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IRQ registers on the Monitor stack 
for all modes 

Once all private registers of all modes are saved, revert to Monitor mode with a 
simple MSR instruction (= simply write Monitor value in the CPSR mode 
field) 

The other solutions have also been considered: 

Add a new instruction that would allow the Monitor to save other 

modes'private registers on its own stack. 

Implement the Monitor as a new "state", i.e. being able to be in Monitor state 
(to have the appropriate access rights) and in IRQ (or any other) mode, to see 
the IRQ (or any other) private registers. 

Boot mechanism 

The boot mechanism must respect the following features: 
Keep compatibility with legacy OSes. 

Boot in most privileged mode to ensure the security of the system. 
As a consequence, Carbon cores will boot in Secure Supervisor mode. 
The different systems will then be: 

For systems wanting to run legacy OSes, the S bit is not taken into account and 
the core will just see it boots in Supervisor mode. 

For systems wanting to use the Carbon features, the core boots in Secure 
privileged mode which should be able to configure all secure protections in the 
system (potentially after swapping to Monitor mode) 



Basic Scenario (See Figure 20) 

Thread 1 is running in non-secure world (S bit 

This thread needs to perform a secure function 



= 0) 

=> SMI instruction. 
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The SMI instruction makes the core enter the Monitor mode through a 
dedicated non-secure SMI vector. 

LR_mon and SPSR_mon are used to save the PC and CPSR of the non secure 
mode. 

The S bit remains unchanged (i.e. non-secure state). 

The monitor kernel saves the non-secure context on the monitor. 

It also pushes LR_mon and SPSR_mon. 

The monitor kernel then changes the "S" bit by writing into the CP 15 register. 
It must keep track that a "secure thread 1" will be started in the secure world 
(e.g. by updating a Thread ID table). 

Finally, it exits the monitor mode and switches to secure supervisor mode 

(MOVS instruction after having updated LR_mon and SPSR_mon?). 

The secure kernel dispatches the application to the right secure memory 

location, then switches to user mode (e.g. using a MOVS). 

The secure function in executed in secure user mode. Once finished, it calls an 

"exit" function by performing an appropriate SWI. 

The SWI instruction makes the core enter the secure svc mode through a 
dedicated SWI vector, that in turn performs the "exit" function. This "exit" 
function ends with an "SMI" to switch back to monitor mode. 
The SMI instruction makes the core enter the monitor mode through a 
dedicated secure SMI vector. 

LR_mon and SPSR_mon are used to save the PC and CPSR of the Secure svc 
mode. 

The S bit remains unchanged (i.e. Secure State). 

The monitor kernel registers the fact that secure thread 1 is finished (removes 
the secure thread 1 ID from the thread ID table?). 

It then changes the "S" bit by writing into the CP 15 register, returning to non- 
secure state. 

The monitor kernel restores the non-secure context from the monitor stack. 
It also load the LRmon and CPSR_mon previously saved in step 2. 
Finally, it exits monitor mode with a SUBS, that will make the core return in 
non-secure user mode, on the instruction 



7. Thread 1 can resume normally. 



Figure 21 illustrates in more detail the operation of the memory management 
logic 30 of one embodiment of the present invention. The memory management logic 
consists of a Memory Management Unit (MMU) 200 and a Memory Protection Unit 
(MPU) 220. Any memory access request issued by the core 1 0 that specifies a virtual 
address will be passed over path 234 to the MMU 200, the MMU 200 being 
responsible for performing predetermined access control functions, more particularly 
for determining the physical address corresponding to that virtual address, and for 
resolving access permission rights and determining region attributes. 

The memory system of the data processing apparatus consists of secure 
memory and non-secure memory, the secure memory being used to store secure data 
that is intended only to be accessible by the core 10, or one or more other master 
devices, when that core or other device is operating in a secure mode of operation, and 
is accordingly operating in the secure domain. 

In the embodiment of the present invention illustrated in Figure 21, the policing 
of attempts to access secure data in secure memory by applications running on the core 
10 in non-secure mode is performed by the partition checker 222 within the MPU 220, 
the MPU 220 being managed by the secure operating system, also referred to herein as 
the secure kernel. 

In accordance with preferred embodiments of the present invention a non- 
secure page table 58 is provided within non-secure memory, for example within a non- 
secure memory portion of external memory 56, and is used to store for each of a 
number of non-secure memory regions defined within that page table a corresponding 
descriptor. The descriptor contains information from which the MMU 200 can derive 
access control information required to enable the MMU to perform the predetermined 
access control functions, and accordingly in the embodiment described with reference 
to Figure 21 will provide information about the virtual to physical address mapping, 
the access permission rights, and any region attributes. 
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Furthermore, in accordance with the preferred embodiments of the present 
invention, at least one secure page table 58 is provided within secure memory of the 
memory system, for example within a secure part of external memory 56, which again 
5 for a number of memory regions defined within the table provides an associated 
descriptor. When the processor is operating in a non-secure mode, the non-secure 
page table will be referenced in order to obtain relevant descriptors for use in 
managing memory accesses, whilst when the processor is operating in secure mode, 
descriptors from the secure page table will be used. 

10 

The retrieval of descriptors from the relevant page table into the MMU 
proceeds as follows. In the event that the memory access request issued by the core 1 0 
specifies a virtual address, a lookup is performed in the micro-TLB 206 which stores 
for one of a number, of virtual address portions the corresponding physical address 

15 portions obtained from the relevant page table. Hence, the micro-TLB 206 will 
compare a certain portion of the virtual address with the corresponding virtual address 
portion stored within the micro-TLB to determine if there is a match. The portion 
compared will typically be some predetermined number of most significant bits of the 
virtual address, the number of bits being dependent on the granularity of the pages ' 

20 within the page table 58. The lookup performed within the micro-TLB 206 will 
typically be relatively quick, since the micro-TLB 206 will only include a relatively 
few number of entries, for example eight entries 

In the event that there is no match found within the micro-TLB 206, then the 
25 memory access request is passed over path 242 to the main TLB 208 which contains a 
number of descriptors obtained from the page tables. As will be discussed in more 
detail later, descriptors from both the non-secure page table and the secure page table 
can co-exist within the main TLB 208, and each entry within the main TLB has a 
corresponding flag (referred to herein as a domain flag) which is settable to indicate 
30 whether the corresponding descriptor in that entry has been obtained from a secure 
page table or a non-secure page table. In any embodiments where all secure modes of 
operation specify physical addresses directly within their memory access requests, it will 
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be appreciated that there will not be a need for such a flag within the main TLB, as the 
main TLB will only store non-secure descriptors. 

Within the main TLB 208, a similar lookup process is performed to determine 
whether the relevant portion of the virtual address issued within the memory access 
request corresponds with any of the virtual address portions associated with descriptors 
in the main TLB 208 that are relevant to the particular mode of operation. Hence, if 
the core 10 is operating in non-secure mode, only those descriptors within the main 
TLB 208 which have been obtained from the non-secure page table will be checked, 
whereas if the core 10 is operating in secure mode, only the descriptors within the 
main TLB that have been obtained from the secure page table will be checked. 

If there is a hit within the main TLB as a result of that checking process, then 
the access control information is extracted from the relevant descriptor and passed 
back over path 242. In particular, the virtual address portion and the corresponding 
physical address portion of the descriptor will be routed over path 242 to the micro- 
TLB 206, for storage in an entry of the micro-TLB, the access permission rights will 
be loaded into the access permission logic 202, and the region attributes will be loaded 
into the region attribute logic 204. The access permission logic 202 and region 
attribute logic 204 may be separate to the micro-TLB, or may be incorporated within 
the micro-TLB. 



At this point, the MMU 200 is then able to process the memory access request 
since there will now be a hit within the micro-TLB 206. Accordingly, the micro-TLB 
206 will generate the physical address, which can then be output over path 238 onto 
the system bus 40 for routing to the relevant memory, this being either on-chip 
memory such as the TCM 36, cache 38, etc, or one of the external memory units 
accessible via the external bus interface 42. At the same time, the access permission 
logic 202 will determine whether the memory access is allowed, and will issue an 
abort signal back to the core 10 over path 230 if it determines that the core is not 
allowed to access the specified memory location in its current mode of operation. For 
example, certain portions of memory, whether in secure memory or non-secure 
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memory, may be specified as only being accessible by the core when that core is 
operating in supervisor mode, and accordingly if the core 10 is seeking to access such 
a memory location when in, for example, user mode, the access permission logic 202 
will detect that the core 10 does not currently have the appropriate access rights, and 
5 will issue the abort signal over path 230. This will cause the memory access to be 
aborted. Finally, the region attribute logic 204 will determine the region attributes for 
the particular memory access, such as whether the access is cacheable, bufferable, etc, 
and will issue such signals over path 232, where they will then be used to determine 
-whether the data the subject of the memory access request can be cached, for example 
10 within the cache 38, whether in the event of a write access the write data can be 
buffered, etc. 

In the event that there was no hit within the main TLB 208, then the translation 
table walk logic 210 is used to access the relevant page table 58 in order to retrieve the 

15 required descriptor over path 248, and then pass that descriptor over path 246 to the 
main TLB 208 for storage therein. The base address for both the non-secure page 
table and the secure page table will be stored within registers of CP15 34, and the 
current domain in which the processor core 10 is operating, i.e. secure domain or non- 
secure domain, will also be set within a register of CP 15, that domain status register 

20 being set by the monitor mode when a transition occurs between the non-secure 
domain and the secure domain, or vice versa. The content of the domain status register 
will be referred to herein as the domain bit. Accordingly, if a translation table walk 
process needs to be performed, the translation table walk logic 210 will know in which 
domain the core 10 is executing, and accordingly which .base address to use to access 

25 the relevant table. The virtual address is then used as an offset to the base address in 
order to access the appropriate entry within the appropriate page table in order to 
obtain the required descriptor. ■ . 

Once the descriptor has been retrieved by the translation table walk logic 210, 
30 and placed within the main TLB 208, a hit will then be obtained within the main TLB, 
and the earlier described process will be invoked to retrieve the access control 
information, and store it within the micro-TLB 206, the access permission logic 202 
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and the region attribute logic 204. The memory access can then be actioned by the 
MMU 200. 

As mentioned earlier, in preferred embodiments, the main TLB 208 can store 
5 descriptors from both the secure page table and the non-secure page table, but the 
memory access requests are only processed by the MMU 200 once the relevant 
information is stored within the micro-TLB 206. In preferred embodiments, the 
transfer of information between the main TLB 208 and the micro-TLB 206 is 
.monitored bythepartition checker 222 located within the MPU 220 to ensure that, in 
10 the event that the core 10 is operating in a non-secure mode, no access control 
information is transferred into the micro-TLB 206 from descriptors in the main TLB 
208 if that would cause a physical address to be generated which is within secure 
memory. 

15 The memory protection unit is managed by the secure operating system, which 

is able to set within registers of the CP 15 34 partitioning information defining the 
partitions between the secure memory and the non-secure memory. The partition 
checker 222 is then able to reference that partitioning information in order to 
determine whether access control information is being transferred to the micro-TLB 

20 206 which would allow access by the core 1 0 in a non-secure mode to secure memory. 
More particularly, in preferred embodiments, when the core 10 is operating in a non- 
secure mode of operation, as indicated by the domain bit set by the monitor mode 
within the CP 15 domain status register, the partition checker 222 is operable to 
monitor via path 244 any physical address portion seeking to be retrieved into the 

25 micro-TLB 206 from the main TLB 208 and to determine whether the physical address 
that would then be produced for the virtual address based on that physical address 
portion would be within the secure memory. In such circumstances, the partition 
checker 222 will issue an abort signal over path 230 to the core 10 to prevent the 
memory access from taking place. 

30 

It will be, appreciated that in addition the partition checker 222 can be arranged 
to actually prevent that physical address portion from being stored in the micro-TLB 
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206 or alternatively the physical address portion may still be stored within the micro- 
TLB 206, but part of the abort process would be to remove that incorrect physical 
address portion from the micro-TLB 206, for example by flushing the micro-TLB 206. 

5 Whenever the core 10 changes via the monitor mode between a non-secure 

mode and a secure mode of operation, the monitor mode will change the value of the 
domain bit within the CP 15 domain status register to indicate the domain into which 
the processor's operation is changing. As part of the transfer process between 
-domains, the micro-TLB 206 will be flushed and accordingly the first memory access 
1 0 following a transition between secure domain and non-secure domain will produce a 
miss in the micro-TLB 206, and require access information to be retrieved from main 
TLB 208, either directly, or via retrieval of the relevant descriptor from the relevant 
page table. 



15 By the above approach, it will be appreciated that the partition checker 222 will 

ensure that when the core is operating in the non-secure domain, an abort of a memory 
access will be generated if an attempt is made to retrieve into the micro-TLB 206 
access control information that would allow access to secure memory. 

20 If in any modes of operation of the processor core 10, the memory access 

request is arranged to specify directly a physical address, then in that mode of 
operation the MMU 200 will be disabled, and the physical address will pass over path 
236 into the MPU 220. In a secure mode of operation, the access permission logic 224 
and the region attribute logic 226 will perform the. necessary access permission and 

25 region attribute analysis based on the access permission rights and region attributes 
identified for the corresponding regions within the partitioning information registers 
within the CP 15 34. If the secure memory location seeking to be accessed is within a 
part of secure memory only accessible in a certain mode of operation, for example 
secure privileged mode, then an access attempt by the core in a different mode of 

30 operation, for example a secure user mode, will cause the access permission logic 224 
to generate an abort over path 230 to the core in the same way that the access 
permission logic . 202 of the MMU would have produced an abort in such 
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circumstances. Similarly, the region attribute logic 226 will generate cacheable and 
bufferable signals in the same way that the region attribute logic 204 of the MMU 
would have generated such signals for memory access requests specified with virtual 
addresses. Assuming the access is allowed, the access request will then proceed over 
path 240 onto the system bus 40, from where it is routed to the appropriate memory 



unit. 



For a non-secure access where the access request specifies a physical address, 
..the_access..request-will be routed via path 236 into the partition checker 222, which 
will perform partition checking with reference to the partitioning information in the 
CP 15 registers 34 in order to determine whether the physical address specifies a 
location within secure memory, in which event the abort signal will again be generated 
over path 230. 



The above described processing of the memory management logic will now be 
described in more detail with reference to the flow diagrams of Figures 23 and 24. 
Figure 23 illustrates the situation in which the program running on the core 10 
generates a virtual address, as indicated by step 300. The relevant domain bit within 
the CP15 domain status register 34 as set by the monitor mode will indicate whether 
the core is currently running in a secure domain or the non-secure domain. In the 
event that the core is running in the secure domain, the process branches to step 302, 
where a lookup is performed within the micro-TLB 206 to see if the relevant portion of 
the virtual address matches with one of the virtual address portions within the micro- 
TLB. In the event of a hit at step 302, the process branches directly to step 312, where 
the access permission logic 202 performs the necessary access permission analysis. At 
step 314, it is then determined whether there is an access permission violation, and if 
there is the process proceeds to step 316, where the access permission logic 202 issues 
an abort over path 230. Otherwise, in the absence of such an access permission 
violation, the process proceeds from step 314 to step 318, where the memory access 
proceeds. In particular the region attribute logic 204 will output the necessary 
cacheable and bufferable attributes over path 232, and the micro-TLB 206 will issue 
the physical address over path 238 as described earlier. 
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If at step 302 there is a miss in the micro-TLB, then a lookup process is 
performed within the main TLB 208 at step 304 to determine whether the required 
secure descriptor is present within the main TLB. If not, then a page table walk 
5 • process is executed at step 306, whereby the translation table walk logic 210 obtains 
the required descriptor from the secure page table, as described earlier with reference 
to Figure 21. The process then proceeds to step 308, or proceeds directly to step 308 
from step 304 in the event that the secure descriptor was already in the main TLB 208. 

10 At step 308, it is determined that the main TLB now contains the valid tagged 

secure descriptor, and accordingly the process proceeds to step 310, where the micro- 
TLB is loaded with the sub-section of the descriptor that contains the physical address 
portion. Since the core 10 is currently running in secure mode, there is no need for the 
partition checker 222 to perform any partition checking function. 

15 

The process then proceeds to step 312 where the remainder of the memory 
access proceeds as described earlier. 

In the event of a non-secure memory access, the process proceeds from step 
20 300 to step 320, where a lookup process is performed in the micro-TLB 206 to 
determine whether the corresponding physical address portion from a non-secure 
descriptor is present. If it is, then the process branches directly to step 336, where the 
access permission rights are checked by the access permission logic 202. It is 
important to note at this point that if the relevant physical address portion is within the 
25 micro-TLB, it is assumed that there is no security violation, since the partition checker 
222 effectively polices the information prior to it being stored within the micro-TLB, 
such that if the information is within the micro-TLB, it is assumed to be the 
appropriate non-secure information. Once the access permission has been checked at 
step 336, the process proceeds to step 338, where it is determined whether there is any 
30 violation, in which event an access permission fault abort is issued at step 316. 
Otherwise, the process proceeds to step 3 1 8 where the remainder of the memory 
access is performed, as discussed earlier. 
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In the event that at step 320 no hit was located in the micro-TLB, the process 
proceeds to step 322, where a lookup process is performed in the main TLB 208 to 
determine whether the relevant non-secure descriptor is present. If not, a page table 
5 walk process is performed at step 324 by the translation table walk logic 210 in order 
to retrieve into the main TLB 208 the necessary non-secure descriptor from the non- 
secure page table. The process then proceeds to step 326, or proceeds directly to step 
326 from step 322 in the event that a hit within the main TLB 208 occurred at step 
322. At step 326, it is determined that the main TLB now contains the valid tagged 

10 non-secure descriptor for the virtual address in question, and then at step 328 the 
partition checker 222 checks that the physical address that would be generated from 
the virtual address of the memory access request (given the physical address portion 
within the descriptor) will point to a location in non-secure memory. If not, i.e. if the 
physical address points to a location in secure memory, then at step 330 it is 

15 determined that there is a security violation, and the process proceeds to step 332 
where a secure/non-secure fault abort is issued by the partition checker 222. 

If however the partition checker logic 222 determines that there is no security 
violation, the process proceeds to step 334, where the micro-TLB is loaded with the 
20 sub-section of the relevant descriptor that contains the physical address portion, 
whereafter at step 336 the memory access is then processed in the earlier described 
manner. 

The handling of memory access requests that directly issue a physical address 
25 will now be described with reference to Figure 24. As mentioned earlier, in this 
scenario, the MMU 200 will be deactivated, this preferably being achieved by the 
setting within a relevant register of the CP 15 registers an MMU enable bit, this setting 
process being performed by the monitor mode. Hence, at step 350 the core 10 will 
generate a physical address which will be passed over path 236 into the MPU 220. 
30 Then, at step 352, the MPU checks permissions to verify that the memory access being 
requested can proceed given the current mode of operation, i.e. user, supervisor, etc. 
Furthermore, if the core is operating in non-secure mode, the partition checker 222 will 
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also check at step 352 whether the physical address is within non-secure memory. 
Then, at step 354, it is determined whether there is a violation, i.e. whether the access 
permission processing has revealed a violation, or if in non-secure mode, the partition 
checking process has identified a violation. If either of these violations occurs, then 
the process proceeds to step 356 where an access permission fault abort is generated 
by the MPU 220. It will be appreciated that in certain embodiments there may be no 
distinction between the two types of abort, whereas in alternative embodiments the 
; abort signal could indicate whether it relates to an access permission fault or a security 
i fault. 

If no violation is detected at step 354, the process proceeds to step 358, where 
the memory access to the location identified by the physical address occurs. 

In preferred embodiments only the monitor mode is arranged to generate 
15 physical addresses directly, and accordingly in all other cases the MMU 200 will be 
active and generation of the physical address from the virtual address of the memory 
access request will occur as described earlier. 

Figure 22 illustrates an alternative embodiment of the memory management 
20 logic in a situation where all memory access requests specify a virtual address, and 
accordingly physical addresses are not generated directly in any of the modes of 
operation. In this scenario, it will be appreciated that a separate MPU 220 is not 
required, and instead the partition checker 222 can be incorporated within the MMU 
200. This change aside, the processing proceeds in exactly the same manner as 
25 discussed earlier with reference to Figures 21 and 23. 

It will be appreciated that various other options are also possible. For example, 
assuming memory access requests may be issued by both secure and non-secure modes 
specifying virtual addresses, two MMUs could be provided, one for secure access 
30 requests and one for non-secure access requests, i.e. MPU 220 in Figure 21 could be 
replaced by a complete MMU. In such cases, the use of flags with the main TLB of 
each MMU to define whether descriptors are secure or non-secure would not be 



i 
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needed, as one MMU would store non-secure descriptors in its main TLB, and the 
other MMU would store secure descriptors in its main TLB. Of course, the partition 
checker would still be required to check whether an access to secure memory is being 
attempted whilst the core is in the non-secure domain. 

5 

If, alternatively, all memory access requests directly specified physical 
addresses, an alternative implementation might be to use two MPUs, one for secure 
access requests and one for non-secure access requests. The MPU used for non-secure 
access requests would have its access requests policed by a partition checker to ensure 
10 accesses to secure memory are not allowed in non-secure modes. 

As a further feature which may be provided with either the Figure 21 or the 
Figure 22 arrangement, the partition checker 222 could be arranged to perform some 
partition checking in order to police the activities of the translation table walk logic 

15 210. In particular, if the core is currently operating in the non-secure domain, then the 
partition checker 222 could be arranged to check, whenever the translation table walk 
logic 210 is seeking to access a page table, that it is accessing the non-secure page 
table rather than the secure page table. If a violation is detected, an abort signal would 
preferably be generated. Since the translation table walk logic 210 typically performs 

20 the page table lookup by combining a page table base address with certain bits of the 
virtual address issued by the memory access request, this partition checking may 
involve, for example, checking that the translation table walk logic 210 is using a base 
address of a non-secure page table rather than a base address of a secure page table. 

25 Figure 25 illustrates schematically the process performed by the partition 

checker 222 when the core 10 is operating in a non-secure mode. It will be 
appreciated that in normal operation a descriptor obtained from the non-secure page 
table should describe a page mapped in non-secure memory only. However, in the 
case of software attack, the descriptor may be tampered with in order that it now 

30 describes a section that contains both non-secure and secure regions of memory. 
Hence, considering the example in Figure 25, the corrupted non-secure descriptor may 
cover a page that includes non-secure areas 370, 372, 374 and secure areas 376, 378, 
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380. If the virtual address issued as part of the memory access request would then 
correspond to a physical address in a secure memory region, for example the secure 
memory region 376 as illustrated in Figure 25, then the partition checker 222 is 
arranged to generate an abort to prevent that access taking place. Hence, even though 
the non-secure descriptor has been corrupted in an attempt to gain access to secure 
memory, the partition checker 222 prevents the access taking place. In contrast, if the 
physical address that would be derived using this descriptor corresponds to a non- 
secure memory region, for example region 374 as illustrated in Figure 25, then the 
(access control information loaded into the micro-TLB 206 merely identifies this non- 
secure region 374. Hence, accesses within that non-secure memory region 374 can 
occur but no accesses into any of the secure regions 376, 378 or 380 can occur. Thus, 
it can be seen that even though the main TLB 208 may contain descriptors from the 
non-secure page table that have been tampered with, the micro-TLB will only contain 
physical address portions that will enable access to non-secure memory regions. 

As described earlier, in embodiments where both non-secure modes and secure 
modes may generate memory access requests specifying virtual addresses, then the 
memory preferably comprises both a non-secure page table within non-secure 
memory, and a secure page table within secure memory. When in non-secure mode, 
the non-secure page table will be referenced by the translation table walk logic 210, 
whereas when in secure mode, the secure page table will be referenced by the 
translation table walk logic 210. Figure 26 illustrates these two page tables. As shown 
in Figure 26, the non-secure memory 390, which may for example be within external 
memory 56 of Figure 1, includes within it a non-secure page table 395 specified in a 
CP 15 register 34 by reference to a base address 397. Similarly, within secure memory 
400, which again may be within the external memory 56 of Figure 1, a corresponding 
secure page table 405 is provided which is specified within a duplicate CP 15 register 
34 by a secure page table base address 407. Each descriptor within the non-secure 
page table 395 will point to a corresponding non-secure page in non-secure memory 
390, whereas each descriptor within the secure page table 405 will define a 
corresponding secure page in the secure memory 400. In addition, as will be described 
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in more detail later, it is possible for certain areas of memory to be shared memory 
regions 410, which are accessible by both non-secure modes and secure modes. 

Figure 27 illustrates in more detail the lookup process performed within the 
5 main TLB 208 in accordance with preferred embodiments. As mentioned earlier, the 
main TLB 208 includes a domain flag 425 which identifies whether the corresponding 
descriptor 435 is from the secure page table or the non-secure page table. This ensures 
that when a lookup process is performed, only the descriptors relevant to the particular 
domain in which the core 10 is operating will be checked. Figure 27 illustrates an 
10 example where the core is running in the secure domain, also referred to as the secure 
world. As can be seen from Figure 27, when a main TLB 208 lookup is performed, 
this will result in the descriptors 440 being ignored, and only the descriptors 445 being 
identified as candidates for the lookup process. 

15 I n accordance with preferred embodiments, an additional process ID flag 430, 

also referred to herein as the ASID flag, is provided to identify descriptors from 
process specific page tables. Accordingly, processes PI, P2 and P3 may each have 
corresponding page tables provided within the memory, and further may have different 
page tables for non-secure operation and secure operation. Further, it will be 

20 appreciated that the processes PI, P2, P3 in the secure domain may be entirely separate 
processes to the processes PI, P2, P3 in the non-secure domain. Accordingly, as 
shown in Figure 27, in addition to checking the domain when a main TLB lookup 208 
is required, the ASID flag is also checked. 

25 Accordingly, in the example in Figure 27 where in the secure domain, process 

PI is executing, this lookup process identifies just the two entries 450 within the main 
TLB 208, and a hit or miss is then generated dependent on whether the virtual address 
portion within those two descriptors matches with the corresponding portion of the 
virtual address issued by the memory access request. If it does, then the relevant 

30 access control information is extracted and passed to the micro-TLB 206, the access 
permission logic 202 and the region attribute logic 204. Otherwise, a miss occurs, and 
the translation table walk logic 210 is used to retrieved into the main TLB 208 the 
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required descriptor from the page table provided for secure process PI. As will be 
appreciated by those skilled in the art, there are many techniques for managing the 
content of a TLB, and accordingly when a new descriptor is retrieved for storage in the 
main TLB 208, and the main TLB is already full, any one of a number of known 
5 techniques may be used to determine which descriptor to evict from the main TLB to 
make room for the new descriptor, for example least recently used approaches, etc. 

It will be appreciated that the secure kernel used in secure modes of operation 
-may be developed entirely separately to the non-secure operating system. However, in 

1 0 certain cases the secure kernel and the non-secure operating system development may 
be closely linked, and in such situations it may be appropriate to allow secure 
applications to use the non-secure descriptors. Indeed, this will allow the secure 
applications to have direct access to non-secure data (for sharing) by knowing only the 
virtual address. This of course presumes that the secure virtual mapping and the non- 

15 secure virtual mapping are exclusive for a particular ASID. In such scenarios, the tag 
introduced previously (i.e. the domain flag) to distinguish between secure and non- 
secure descriptors will not be needed. The lookup in the TLB is instead then 
performed with all of descriptors available. 

20 In preferred embodiments, the choice between this configuration of the main 

TLB, and the earlier described configuration with separate secure and non-secure 
descriptors, can be set by a particular bit provided within the CP 15 control registers. 
In preferred embodiments, this bit would only be set by the secure kernel. 

25 In embodiments where the secure application were directly allowed to use a 

non-secure virtual address, it would be possible to make a non-secure stack pointer 
available from the secure domain. This can be done by copying a non-secure register 
value identifying the non-secure stack pointer into a dedicated register within the CP 15 
registers 34. This will then enable the non-secure application to pass parameters via 

30 the stack according to a scheme understood by the secure application. 
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As described earlier, the memory may be partitioned into non-secure and 
secure parts, and this partitioning is controlled by the secure kernel using the CP 15 
registers 34 dedicated to the partition checker 222. The basic partitioning approach is 
based on region access permissions as definable in typical MPU devices. Accordingly, 
the memory is divided into regions, and each region is preferably defined with its base 
address, size, memory attributes and access permissions. Further, when overlapping 
regions are programmed, the attributes of the upper region take highest priority. 
Additionally, in accordance with preferred embodiments of the present invention, a 
new region attribute is provided to define whether that corresponding region is in 
secure memory or in non-secure memory. This new region attribute is used by the 
secure kernel to define the part of the memory that is to be protected as secure 
memory. 



At the boot stage, a first partition is performed as illustrated in Figure 28. This 
initial partition will determine the amount of memory 460 allocated to the non-secure 
world, non-secure operating system and non-secure applications. This amount 
corresponds to the non-secure region defined in the partition. This information will 
then be used by the non-secure operating system for its memory management. The 
rest of the memory 462, 464, which is defined as secure, is unknown by the non-secure 
operating system. In order to protect integrity in the non-secure world, the non-secure 
memory may be programmed with access permission for secure privileged modes 
only. Hence, secure applications will not corrupt the non-secure ones. As can be seen 
from Figure 28, following this boot stage partition, memory 460 is available for use by 
the non-secure operating system, memory 462 is available for use by the secure kernel, 
and memory 464 is available for use by secure applications. 

Once the boot stage partition has been performed, memory mapping of the non- 
secure memory 460 is handled by the non-secure operating system using the MMU 
200, and accordingly a series of non-secure pages can be defined in the usual manner. 
This is illustrated in Figure 29. 
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If a secure application needs to share memory with a non-secure application, 
the secure kernel can change the rights of a part of the memory to transfer artificially 
data from one domain to the other. Hence, as illustrated in Figure 30, the secure kernel 
can, after checking the integrity of a non-secure page, change the rights of that page 
5 such that it becomes a secure page 466 accessible as shared memory. 

When the partition of the memory is changed, the micro-TLB 206 needs to be 
flushed. Hence, in this scenario, when a non-secure access subsequently occurs, a 
miss will occur in the micro-TLB 206, and accordingly a new descriptor will be loaded 
10 from the main TLB 208. This new descriptor will subsequently be checked by the 
partition checker 222 of the MPU as it is attempted to retrieve it into the micro-TLB 
206, and so will be consistent with the new partition of the memory. 

In preferred embodiments, the cache 38 is virtual-indexed and physical-tagged. 

15 Accordingly, when an access is performed in the cache 38, a lookup will have already 
been performed in the micro-TLB 206 first, and accordingly access permissions, 
especially secure and non-secure permissions, will have been checked. Accordingly, 
secure data cannot be stored in the cache 38 by non-secure applications. Access to the 
cache 38 is under the control of the partition checking performed by the partition 

20 checker 222, and accordingly no access to secure data can be performed in non-secure 
mode.. 

However, one problem that could occur would be for an application in the non- 
secure domain to be able to use the cache operations register to invalidate, clean, or 
25 flush the cache. It needs to be ensured that such operations could not affect the 
security of the system. For example, if the non-secure operating system were to 
invalidate the cache 38 without cleaning it, any secure dirty data must be written to the 
external memory before being replaced. Preferably, secure data is tagged in the cache, 
and accordingly can be dealt with differently if desired. 

30 

In preferred embodiments, if an "invalidate line by address" operation is 
executed by a non-secure program, the physical address is checked by the partition 
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checker 222, and if the cache line is a secure cache line, the operation becomes a 
"clean and invalidate" operation, thereby ensuring that the security of the system is 
maintained. Further, in preferred embodiments, all "invalidate line by index" 
operations that are executed by a non-secure program become "clean and invalidate by 
5 index" operations. Similarly, all "invalidate all" operations executed by a non-secure 
program become "clean and invalidate all" operations. 

Furthermore, with reference to Figure 1, any access to the TCM 36 by the 
; DMA32 is controlled by the micro-TLB 206. Hence, when the DMA 32 performs a 

10 lookup in the TLB to translate its virtual address into a physical one, the earlier 
described flags that were added in the main TLB allow the required security checking 
to be performed, just as if the access request had been issued by the core 10. Further, 
as will be discussed later, a replica partition checker is coupled to the external bus 70, 
preferably being located within the arbiter/decoder block 54, such that if the DMA 32 

15 directly accesses the memory coupled to the external bus 70 via the external bus 
interface 42, the replica partition checker connected to that external bus checks the 
validity of the access. Furthermore, in certain preferred embodiments, it would be 
possible to add a bit to the CP15 registers 34 to define whether the DMA controller 32 
can be used in the non-secure domain, this bit only being allowed to be set by the 

20 secure kernel when operating in a privileged mode. 

Considering the TCM 36, if secure data is to placed within the TCM 36, this 
must be handled with care. As an example, a scenario could be imagined where the 
non-secure operating system programs the physical address range for the TCM 

25 memory 36 so that it overlaps an external secure memory part. If the mode of 
operation then changes to a secure mode, the secure kernel may cause data to be stored 
in that overlapping part, and typically the data would be stored in the TCM 36, since 
the TCM 36 will typically have a higher priority than the external memory. If the non- 
secure operating system were then to change the setting of the physical address space 

30 for the TCM 36 so that the previous secure region is now mapped in a non-secure 
physical area of memory, it will be appreciated that the non-secure operating system 
can then access the secure data, since the partition checker will see the area as non- 
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secure and won't assert an abort. Hence, to summarise, if the TCM is configured to 
act as normal local RAM and not as SmartCache, it may be possible for the non-secure 
operating system to read secure world data if it can move the TCM base register to 
non-secure physical address. 

To prevent this kind of scenario, a control bit is in preferred embodiments 
provided within the CP 15 registers 34 which is only accessible in secure privilege 
modes of operation, and provides two possible configurations. In a first 
configuration, this control bit is set to "1", in which event the TCM can only be 
controlled by the secure privilege modes. Hence, any non-secure access attempted to 
the TCM control registers within the CP15 34 will cause an undefined instruction 
exception to be entered. Thus, in this first configuration, both secure modes and non- 
secure modes can use the TCM, but the TCM is controlled only by the secure privilege 
mode. In the second configuration, the control bit is set to "0", in which event the 
TCM can be controlled by the non-secure operating system. In this case, the TCM is 
only used by the non-secure applications. No secure data can be stored to or loaded 
from the TCM. Hence, when a secure access is performed, no look-up is performed 
within the TCM to see if the address matched the TCM address range. 

By default, it is envisaged that the TCM would be used only by non-secure 
operating systems, as in this scenario the non-secure operating system would not need 
to be changed. 

As mentioned earlier, in addition to the provision of the partition checker 222 
within the MPU 220, preferred embodiments of the present invention also provide an 
analogous partition checking block coupled to the external bus 70, this additional 
partition checker being used to police accesses to memory by other master devices, for 
example the digital signal processor (DSP) 50, the DMA controller 52 coupled directly 
to the external bus, the DMA controller 32 connectable to the external bus via the 
external bus interface 42, etc. As mentioned earlier, the entire memory system can 
consist of several memory units, and a variety of these may exist on the external bus 
70, for example the external memory 56, boot ROM 44, or indeed buffers or registers 
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48, 62, 66 within peripheral devices such as the screen driver 46, I/O interface 60, key 
storage unit 64, etc. Furthermore, different parts of the memory system may need to 
be defined as secure memory, for example it may be desired that the key buffer 66 
within the key storage unit 64 should be treated as secure memory. If an access to 
such secure memory were to be attempted by a device coupled to the external bus, then 
it is clear that the earlier described memory management logic 30 provided within the 
chip containing the core 10 would not be able to police such accesses. 

;'- Figure 31 illustrates how the additional partition checker 492 coupled to the 

external bus, also referred to herein as the device bus, is used. The external bus would 
typically be arranged such that whenever memory access requests were issued onto 
that external bus by devices, such as devices 470, 472, those memory access requests 
would also include certain signals on the external bus defining the mode of operation, 
for example privileged, user, etc. In accordance with preferred embodiments of the 
present invention the memory access request also involves issuance of a domain signal 
onto the external bus to identify whether the device is operating in secure mode or 
non-secure mode. This domain signal is preferably issued at the hardware level, and in 
preferred embodiments a device capable of operating in secure or non-secure domains 
will include a predetermined pin for outputting the domain signal onto path 490 within 
the external bus. For the purpose of illustration, this path 490 is shown separately to 
the other signal paths 488 on the external bus. 

This domain signal, also referred to herein as the "S bit" will identify whether 
the device issuing the memory access request is operating in secure domain or non- 
secure domain, and this information will be received by the partition checker 492 
coupled to the external bus. The partition checker 492 will also have access to the 
partitioning information identifying which regions of memory are secure or non- 
secure, and accordingly can be arranged to only allow a device to have access to a 
secure part of memory if the S bit is asserted to identify a secure mode of operation. 



By default, it is envisaged that the S bit would be unasserted, and accordingly a 
pre-existing non-secure device, such as device 472 illustrated in Figure 31, would not 
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output an asserted S bit and accordingly would never be granted access by the partition 
checker 492 to any secure parts of memory, whether that be within registers or buffers 
482, 486 within the screen driver 480, the I/O interface 484, or within the external 
memory 474. 



For the sake of illustration, the arbiter block 476 used to arbitrate between 
memory access requests issued by master devices, such as devices 470, 472, is 
illustrated separately to the decoder 478 used to determine the appropriate memory 
■ device to service the memory access request, and separate from the partition checker 
492. However, it will be appreciated that one or more of these components may be 
integrated within the same unit if desired. 

Figure 32 illustrates an alternative embodiment, in which a partition checker 
492 is not provided, and instead each memory device 474, 480, 484 is arranged to 
police its own memory access dependent on the value of the S bit. Accordingly, if 
device 470 were to assert a memory access request in non-secure mode to a register 
482 within the screen driver 480 that was marked as secure memory, then the screen 
driver 480 would determine that the S bit was not asserted, and would not process the 
memory access request. Accordingly, it is envisaged that with appropriate design of 
the various memory devices, it may be possible to avoid the need for a partition 
checker 492 to be provided separately on the external bus. 

Figure 33 shows different modes and applications running on a processor. The 
dashed lines indicate how different modes and/or applications can be separated and 
isolated from one another during monitoring of the processor according to an 
embodiment of the present invention. 

The ability to monitor a processor to locate possible faults and discover why an 
application is not performing as expected is extremely useful and many processors 
provide such functions. The monitoring can be performed in a variety of ways 
including debug and trace functions. 
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In the processor according to the present technique debug can operate in 
several modes including halt debug mode and monitor debug mode. These modes are 
intrusive and cause the program running at the time to be stopped. In halt debug 
mode, when a breakpoint or watchpoint occurs, the core is stopped and isolated from 
the rest of the system and the core enters debug state. On entry the core is halted, the 
pipeline is flushed and no instructions are pre-fetched. The PC is frozen and any 
interrupts (IRQ and FIQ) are ignored. It is then possible to examine the core internal 
state (via the JTAG serial interface) as well as the state of the memory system. This 
state is invasive to program execution, as it is possible to modify current mode, change 
register contents, etc. Once Debug is terminated, the core exits from the Debug State 
by scanning in the Restart instruction through the Debug TAP (test access port). Then 
the program resumes execution. 

In monitor debug mode, a breakpoint or watchpoint causes the core to enter 
abort mode, taking prefetch or Data Abort vectors respectively. In this case, the core 
is still in a functional mode and is not stopped as it is in Halt debug mode. The abort 
handler communicates with a debugger application to access processor and 
coprocessor state or dump memory. A debug monitor program interfaces between the 
debug hardware and the software debugger. If bit 1 1 of the debug status and control 
register DSCR is set (see later), interrupts (FIQ and IRQ) can be inhibited. In monitor 
debug mode, vector catching is disabled on Data Aborts and Prefetch Aborts to avoid 
the processor being forced into an unrecoverable state as a result of the aborts that are 
generated for the monitor debug mode. It should be noted that monitor debug mode is 
a type of debug mode and is not related to monitor mode of the processor which is the 
mode that supervises switching between secure world and non-secure world. 

Debug can provide a snapshot of the state of a processor at a certain moment. 
It does this by noting the values in the various registers at the moment that a debug 
initiation request is received. These values are recorded on a scan chain (541, 544 of 
Figure 41) and they are then serially output using a JTAG controller (1 8 or Figure 1). 
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An alternative way of monitoring the core is by trace. Trace is not intrusive 
and records subsequent states as the core continues to operate. Trace runs on an 
embedded trace macrocell (ETM) 22, 26 of Figure 1. The ETM has a trace port 
through which the trace information is exported, this is then analysed by an external 
5 trace port analyser. 

The processor of embodiments of the present technique operates in two 
. ■' separate domains, in the embodiments described these domains comprise secure and 
;-non-secure domains. -However, for the purposes of the monitoring functions, it will be 
10 clear to the skilled person that these domains can be any two domains between which 
data should not leak.- Embodiments of the present technique are concerned with 
preventing leakage of data between the two domains and monitoring functions such as 
debug and trace which are conventionally allowed access to the whole system are a 
potential source of data leakage between the domains. 

15 

In the example given above of a secure and non-secure domain or world, 
secure data must not be available to the non-secure world. Furthermore, if debug is 
permitted, in secure world, it may be advantageous for some of the data within secure 
world to be restricted or hidden. The hashed lines in Figure 33 shows some examples 

20 of possible ways to segment data access and provide different levels of granularity. In 
Figure 33, monitor mode is shown by block 500 and is the most secure of all the 
modes and controls switching between secure and non-secure worlds. Below monitor 
mode 500 there is a supervisor mode, this comprises secure supervisor mode 510 and 
non-secure supervisor mode 520. Then there is non-secure user mode having 

25 applications 522 and 524 and secure user mode with applications 512, 514 and 516. 
The monitoring modes (debug and trace) can be controlled to only monitor non-secure 
mode (to the left of hashed line 501). Alternatively the non-secure domain or world 
and the secure user mode may be allowed to be monitored (left of 501 and the portion 
right of 501 that lies below 502). In a further embodiment the non-secure world and 

30 certain applications running in the secure user domain may be allowed, in this case 
further segmentation by hashed lines 503 occurs. Such divisions help prevent leakage 
of secure data between different users who may be running the different applications. 
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In some controlled cases monitoring of the entire system may be allowed. According 
to the granularity required the following parts of the core need to have their access 
controlled during monitoring functions. 

5 There are four registers that can be set on a Debug event; the instruction Fault 

Status Register (IFSR), Data Fault Status Register (DFSR), Fault Address Register 
(FAR), and Instruction Fault Address Register (I FAR). These registers should be 
flushed in some embodiments when going from secure world to non-secure world to 
avoid any leak of data. 

10 

PC sample register: The Debug TAP can access the PC through scan chain 7. 
When debugging in secure world, that value may be masked depending on the debug 
granularity chosen in secure world. It is important that non-secure world, or non- 
secure world plus secure user applications cannot get any value of the PC while the 
1 5 core is running in the secure world. 



TLB entries: Using CP 15 it is possible to read micro TLB entries and read and 
write main TLB entries. We can also control main TLB and micro TLB loading and 
matching. This kind of operation must be strictly controlled, particularly if secure 
20 thread-aware debug requires assistance of the MMU/MPU. 

Performance Monitor Control register: The performance control register gives 
information on the cache misses, micro TLB misses, external memory requests, branch 
instruction executed, etc. Non-secure world should not have access to this data, even 
25 in Debug State. The counters should be operable in secure world even if debug is 
disabled in secure world. 

Debugging in cache system: Debugging must be non-intrusive in a cached 
system. It is important is to keep coherency between cache and external memory. The 
30 Cache can be invalidated using CP1 5, or the cache can be forced to be write-through in 
all regions. In any case, allowing the modification of cache behaviour in debug can be 
a security weakness and should be controlled. 
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Endianness: Non-secure world or secure user applications that can access to 
debug should not be allowed to change endiarmess. Changing the endianness could 
cause the secure kernel to malfunction. Endianness access is prohibited in debug, 
5 according to the granularity. 

Access of the monitoring functions to portions of the core can be controlled at 
initiation of the monitoring function. Debug and trace are initialised in a variety of 
; ways.— Embodiments of the present technique control the access of the monitoring 
0 function to certain secure portions of the core by only allowing initialisation under 
certain conditions. 

Embodiments of the present technique seek to restrict entry into monitoring 
functions with the following granularity: 
5 By controlling seperately intrusive and observable (trace) debug; 

By allowing debug entry in secure user mode only or in the whole secure 

world; 

By allowing debug in secure user mode only and moreover taking account of 
the thread ID (application running). 

0 

In order to control the initiation of a monitoring function it is important to be 
aware of how the functions can be initiated. Figure 34 shows a table illustrating the 
possible ways of initiating a monitoring function, the type of monitoring function that 
is initiated and the way that such an initiation instruction can be programmed. 

5 

Generally, these monitoring instructions can be entered via software or via 
hardware, i.e. via the JTAG controller. In order to control the initiation of monitoring 
functions, control values are used. These comprise enable bits which are condition 
dependent and thus, if a particular condition is present, monitoring is only allowed to 
0 start if the enable bit is set. These bits are stored on a secure register CP 14 (debug and 
status control register, DSCR), which is located in ICE 530 (see Figure 41). 
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In a preferred embodiment there are four bits that enable/disable intrusive and 
observable debug, these comprise a secure debug enable bit, a secure trace enable bit, a 
secure user-mode enable bit and a secure thread aware enable bit. These control 
values serve to provide a degree of controllable granularity for the monitoring function 
5 and as such can help stop leakage of data from a particular domain. Figure 35 
provides a summary of these bits and how they can be accessed. 

These control bits are stored in a register in the secure domain and access to 
this register is limited to three possibilities. Software access is provided via ARM 

10 coprocessor MRC/MCR instructions and these are only allowed from the secure 
supervisor mode. Alternatively, software access can be provided from any other mode 
with the use of an authentication code. A further alternative relates more to hardware 
access and involves the instructions being written via an input port on the JTAG. In 
addition to being used to input control values relating to the availability of monitoring 

1 5 functions, this input port can be used to input control values relating to other functions 
of the processor. 

Further details relating to the scan chain and JTAG are given below. 

20 Register logic cell 

Every integrated circuit (IC) consists of two kind of logic: 

■ Combinatory logic cells; like AND, OR, INV gates. Such gates or combination of 
such gates is used to calculate Boolean expressions according to one or several 

25 input signals. 

■ Register logic cells; like LATCH, FLIP-FLOP. Such cells are used to memorize 
any signal value. Figure 36 shows a positive-edge triggered FLIP-FLOP view: 

When positive-edge event occurs on the clock signal (CK), the output (Q) 
30 received the value of the input (D); otherwise the output (Q) keeps its value in 
memory. 
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Scan chain cell 

For test or debug purpose, it is required to bypass functional access of register 
5 logic cells and to have access directly to the contents of the register logic cells. Thus 
register cells are integrated in a scan chain cell as shown in Figure 37. 

In functional mode, SE (Scan Enable) is clear and the register cell works as a 

single register cell. In test or debug mode, SE is set and input data can come from SI 
1 0 . input (Scan In) instead of D input. 

Scan chain 

All scan chain cells are chained in scan chain as shown in figure 38. 

15 

In functional mode, SE is clear and all register cells can be accessed normally 
and interact with other logic of the circuit. In Test or Debug mode, SE is set and all 
registers are chained between each other in a scan chain. Data can come from the first 
scan chain cell and can be shifted through any other scan chain cell, at the cadence of 
20 each clock cycle. Data can be shifted out also to see the contents of the registers. 



TAP controller 

A debug TAP controller is used to handle several scan chains. The TAP 
controller can select a particular scan chain: it connects "Scan In" and "Scan Out" 
signals to that particular scan-chain. Then data can be scanned into the chain, shifted, 
or scanned out. The TAP controller is controlled externally by a JTAG port interface. 
Figure 39 schematically illustrates a TAP controller 
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JTAG Selective Disable Scan Chain Cell 

5 For security reasons, some registers might not be accessible by scan chain, 

even in debug or test mode. A new input called JADI (JTAG Access Disable) can 
allow removal dynamically or statically of a scan chain cell from a whole scan chain, 
without modifying the scan chain structure in the integrated circuit. Figures 40A and 
B schematically show this input. 

10 

If JADI is inactive (JADI = 0), whether in functional or test or debug mode, the 
scan chain works as usual. If JADI is active (JADI = 1), and if we are in test or debug 
mode, some scan chain cells (chosen by designer), may be "removed" from the scan 
chain structure. In order to keep the same number of scan-chain cell, the JTAG 
15 Selective Disable Scan Chain Cell use a bypass register. Note that Scan Out (SO) and 
scan chain cell output (Q) are now different. 

Figure 41 schematically shows the processor including parts of the JTAG. In 
normal operation instruction memory 550 communicates with the core and can under 
20 certain circumstances also communicate with register CP 14 and reset the control 
values. This is generally only allowable from secure supervisor mode. 

When debug is initiated instructions are input via debug TAP 580 and it is 
these that control the core. The core in debug runs in a step by step mode. Debug 
25 TAP has access to CP 14 via the core (in dependence upon an access control signal 
input on the JSDAEN pin shown as JADI pin, JTAG ACCESS DISABLE INPUT in 
Figure 40) and the control values can also be reset in this way. 

Access to the CP14 register via debug TAP 580 is controlled by an access 
30 control signal JSDAEN. This is arranged so that in order for access and in particular 
write access to be allowed JSDAEN must be set high. During board stage when the 
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whole processor is being verified, JSDAEN is set high and debug is enabled on the 
whole system. Once the system has been checked, the JSDAEN pin can be tied to 
ground, this means that access to the control values that enable debug in secure mode 
is now not available via Debug TAP 580. Generally processors in production mode 
5 have JSDAEN tied to ground. Access to the control values is thus, only available via 
the software route via instruction memory 550. Access via this route is limited to 
secure supervisor mode or to another mode provided an authentication code is given 
(see Figure 42). 

10 It should be noted that by default debug (intrusive and observable - trace) are 

only available in non-secure world. To enable them to be available in secure world the 
control value enable bits need to be set. 

The advantages of this are that debug can always be initiated by users to run in 
15 non-secure world. Thus, although access to secure world is not generally available to 
users in debug this may not be a problem in many cases because access to this world is 
limited and secure world has been fully verified at board stage prior to being made 
available. It is therefore foreseen that in many cases debugging of the secure world 
will not be necessary. A secure supervisor can still initiate debug via the software 
20 route of writing CP 1 4 if necessary. 

Figure 42 schematically shows the control of debug initialisation. In this figure 
a portion of the core 600 comprises a storage element 601 (which may be a CP 15 
register as previously discussed) in which is stored a secure status bit S indicative of 
25. whether the system is in secure world or not. Core 600 also comprises a register 602 
comprising bits indicative of the mode that the processor is running in, for example 
user mode, and a register 603 providing a context identifier that identifies the 
application or thread that is currently running on the core. 

30 When a breakpoint is reached comparator 610, which compares a breakpoint 

stored on register 61 1 with the address of the core stored in register 612, sends a signal 
to control logic 620. Control logic 620 looks at the secure state S, the mode 602 and 
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the thread (context identifier) 603 and compares it with the control values and 
condition indicators stored on register CP 14. If the system is not operating in secure 
world, then a "enter debug" signal will be output at 630. If however, the system is 
operating in secure world, the control logic 620 will look at the mode 602, and if it is 
5 in user mode will check to see if user mode enable and debug enable bits are set. If 
they are then debug will be initialised provided that a thread aware bit has not been 
initialised. The above illustrates the hierarchical nature of the control values. 

; The thread aware portion of the monitoring control is also shown schematically 

10 in Figure 42 along with how the control value stored in register CP 14 can only be 
changed from secure supervisor mode (in this embodiment the processor is in 
production stage and JSDAEN is tied to ground). From a secure user mode, secure 
supervisor mode can be entered using an authentication code and then the control 
value can be set in CP 14. 

15 

Control logic 620 outputs an "enter debug" signal when address comparator 
610 indicates that a breakpoint has been reached provided thread comparator 640 
shows that debug is allowable for that thread. This assumes that the thread aware 
initialisation bit is set in CP 14. If the thread aware initialisation bit is set following a 

20 breakpoint, debug or trace can only be entered if address and context identifiers match 
those indicated in the breakpoint and in the allowable thread indicator. Following 
initiation of a monitoring function, the capture of diagnostic data will only continue 
while the context identifier is detected by comparator 640 as an allowed thread. When 
a context identifier shows that the application running is not an allowed one, then the 

25 capture of diagnostic data is suppressed. 

It should be noted that in the preferred embodiment, there is some hierarchy 
within the granularity. In effect the secure debug or trace enable bit is at the top, 
followed by the secure user-mode enable bit and lastly comes the secure thread aware 
30 enable bit. This is illustrated in Figures 43A and 43B (see below). 
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The control values held in the "Debug and Status Control" register (CP 14) 
control secure debug granularity according to the domain, the mode and the executing 
thread. It is on top of secure supervisor mode. Once the "Debug and Status Control" 
register CP 14 is configured, it's up to secure supervisor mode to program the 
corresponding breakpoints, watchpoints, etc to make the core enter Debug State. 

Figure 43A shows a summary of the secure debug granularity for intrusive 
debug. Default values at reset are represented in grey colour. 

10 ]t is the same for debug granularity concerning observable debug . Figure 43B 

shows a summary of secure debug granularity in this case, here default values at reset 
are also represented in grey colour. 

Note that Secure user-mode debug enable bit and Secure thread-aware debug 
1 5 enable bit are commonly used for intrusive and observable debug. 

A thread aware initialisation bit is stored in register CP 14 and indicates if 
granularity by application is required. If the thread aware bit has been initialised, the 
control logic will further check that the application identifier or thread 603 is one 
20 indicated in the thread aware control value, if it is, then debug will be initialised. If 
either of the user mode or debug enable bits are not set or the thread aware bit is set 
and the application running is not one indicated in the thread aware control value, then 
the breakpoint will be ignored and the core will continue doing what it was doing and 
debug will not be initialised. 

25 

In addition to controlling initialisation of monitoring functions, the capture of 
diagnostic data during a monitor function can also be controlled in a similar way. In 
order to do this the core must continue to consider both the control values, i.e. the 
enable bits stored in register CP 14 and the conditions to which they relate during 
30 operation of the monitoring function. 
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Figure 44 shows schematically granularity of a monitoring function while it is 
running. In this case region A relates to a region in which it is permissible to capture 
diagnostic data and region B relates to region in which control values stored in CP 14 
indicate that it is not possible to capture diagnostic data. 

5 

Thus, when debug is running and a program is operating in region A, 
diagnostic data is output in a step-by-step fashion during debug. When operation 
switches to Region B, where the capture of diagnostic data is not allowed, debug no 
' longer proceeds in a step by step fashion, rather it proceeds atomically and no data is 
10 captured. This continues until operation of the program re-enters region A whereupon 
the capture of diagnostic data starts again and debug continues running in a step-by- 
step fashion. 

In the above embodiment, if secure domain is not enabled, a SMI instruction is 
15 always seen as an atomic event and the capture of diagnostic data is suppressed. 

Furthermore, if the thread aware initialisation bit is set then granularity of the 
monitoring function during operation with respect to application also occurs. 

20 

With regard to observable debug or trace, this is done by ETM and is entirely 
independent of debug. When trace is enabled ETM works as usual and when it is 
disabled, ETM hides trace in the secure world, or part of the secure world depending 
on the granularity chosen. One way to avoid ETM capturing and tracing diagnostic 
25 data in the secure domain when this is not enabled is to stall ETM when the S-bit is 
high. This can be done by combining the S-bit with the ETMPWRDOWN signal, so 
that the ETM values are held at their last values when the core enters secure world. 
The ETM should thus trace a SMI instruction and then be stalled until the core returns 
to non-secure world. Thus, the ETM would only see non-secure activity. 

30 

A summary of some of the different monitoring functions and their granularity 
is given below. 
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Intrusive debug at board stage 

At board stage when the JSDAEN pin is not tied, there is the ability to enable 
debug everywhere before starting any boot session. Similarly, if we are in secure 
5 supervisor mode we have similar rights. 

If we initialise debug in halt debug mode all registers are accessible (non- 
* ; secure and secure register banks) and the whole memory can be dumped, except the 
i bits dedicated to control debug. 

10 

Debug halt mode can be entered from whatever mode and from whatever 
domain. Breakpoints and watchpoints can be set in secure or in non-secure memory. 
In debug state, it is possible to enter secure world by simply changing the S bit via an 
. MCR instruction. 

15 

As debug mode can be entered when secure exceptions occur, the vector trap 
register is extended with new bits which are; 
SMI vector trapping enable 
Secure data abort vector trapping enable 
20 Secure prefetch abort vector trapping enable 

Secure undefined vector trapping enable. 

In monitor debug mode, if we allow debug everywhere, even when an SMI is 
called in non-secure world, it is possible to enter secure world in step-by-step debug. 
25 When a breakpoint occurs in secure domain, the secure abort handler is operable to 
dump secure register bank and secure memory. 

The two abort handlers in secure and in non-secure world give their 
information to the debugger application so that debugger window (on the associated 
30 debug controlling PC) can show the register state in both secure and non-secure 
worlds. 
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Figure 45A shows what happens when the core is configured in monitor debug 
mode and debug is enabled in secure world. Figure 45B shows what happens when 
the core is configured in monitor debug mode and the debug is disabled in secure 
world. This later process will be described below. 

5 

Intrusive debug at production stage 

In production stage when JSDAEN is tied and debug is restricted to non-secure 
world, unless the secure supervisor determines otherwise, then the table shown in 
Figure 45B shows what happens. In this case SMI should always be considered as an 
10 atomic instruction, so that secure functions are always finished before entering debug 
state. 

Entering debug halt mode is subject to the following restrictions: 

15 External debug request or internal debug request is taken into account in non- 

secure world only. If EDBGRQ (external debug request) is asserted while in secure 
world, the core enters debug halt mode once secure function is terminated and the core 
is returned in non-secure world. 

20 Programming a breakpoint or watchpoint on secure memory has no effect and 

the core is not stopped when the programmed address matches. 

Vector Trap Register (details of this are given below) concerns non-secure 
exceptions only. All extended trapping enable bits explained before have no effect. 

25 

Once in halt debug mode the following restrictions apply: 
S bit cannot be changed to force secure world entry, unless secure debug is 
enabled. 

Mode bits can not be changed if debug is permitted in secure supervisor mode 

30 only. 

Dedicated bits that control secure debug cannot be changed. 
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If a SMI is loaded and executed (with system speed access), the core re-enters 
debug state only when secure function is completely executed. 

In monitor debug mode because monitoring cannot occur in secure world, the 
5 secure abort handler does not need to support a debug monitor programme. In non 
secure world, step-by-step is possible but whenever an SMI is executed secure 
function is executed entirely in other words an XWSI only "step-over" is allowed 
while "step-in" and "step-over" are possible on all other instructions. XWSI is thus 
considered an atomic instruction. 

10 

Once secure debug is disabled, we have the following restrictions: 
Before entering monitor mode: 

Breakpoints and watchpoints are only taken into account in non-secure world. 
15 If bit S is set, breakpoints/watchpoints are bypassed. Note that watchpoints units are 
also accessible with MCR/MRC (CP 14) which is not a security issue as 
breakpoint/watchpoint has no effect in secure memory. 

BKPT are normally used to replace the instruction on which breakpoint is set. 
20 This supposes to overwrite this instruction in memory by BKPT instruction, which will 
be possible only in non-secure mode. 

Vector Trap Register concerns non-secure exceptions only. All extended 
trapping, enable bits explained before have no effect. Data abort and Pre-fetch abort 
25 enable bits should be disabled to avoid the processor being forced in to an 
unrecoverable state. 

Via JTAG, we have the same restrictions as for halt mode (S bit cannot be 
modified, etc) 

30 Once in monitor mode (non-secure abort mode) 

The non-secure abort handler can dump non-secure world and has no visibility 
on secure banked registers as well as secure memory. 
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Executes secure functions with atomic SMI instruction 
S bit cannot be changed to force secure world entry. 

Mode bits can not be changed if debug is permitted in secure supervisor mode 

only. 

5 

Note that if an external debug request (EDBGRQ) occurs, 

In non-secure world, the core terminates the current instruction and enters then 
immediately debug state (in halt mode). 

In secure world, the core terminates the current function and enters the Debug 
1 0 State when it has returned in non-secure world. 

The new debug requirements imply some modifications in core hardware. The 
S bit must be carefully controlled, and the secure bit must not be inserted in a scan 
chain for security reason. 

15 

In summary, in debug, mode bits can be altered only if debug is enabled in 
secure supervisor mode. It will prevent anybody that has access to debug in the secure 
domain to have access to all secure world by altering the system (modifying TBL 
entries, etc). In that way each thread can debug its own code, and only its own code. 
20 The secure kernel must be kept safe. Thus when entering debug while the core is 
running in non-secure world, mode bits can only be altered as before. 

Embodiments of the technique use a new vector trap register. If one of the 
bits in this register is set high and the corresponding vector triggers, the processor 
25 enters debug state as if a breakpoint has been set on an instruction fetch from the 
relevant exception vector. The behaviour of these bits may be different according to 
the value of 'Debug in Secure world Enable' bit in debug control register. 

The new vector trap register comprises the following bits: D_s_abort, 
30 P_s_abort, S_undef, SMI, FIQ, IRQ, Unaligned, D_abort, P_abort, SWI and Undef. 
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• D_s_abort bit: should only be set when debug is enabled in secure world and when 
debug is configured in halt debug mode. In monitor debug mode, this bit should 
never bit set. If debug in secure world is disabled, this bit has no effect whatever its 
value. 

5 • P_s_abort bit: same as D_s_abort bit. 

• S_undef bit: should only be set when debug is enable in secure world. If debug in 
secure world is disabled, this bit has no effect whatever its value is. 

• SMI bit: should only be set when debug is enabled in secure- world. If debug in 
secure world is disabled, this bit has no effect whatever its value is. 

10 • FIQ, IRQ, D_abort, P_abort, SWI, undef bits: correspond to non-secure 
exceptions, so they are valid even if debug in secure world is disabled. Note that 
D_abort and P_abort should not be asserted high in monitor mode. 

• Reset bit: as we enter secure world when reset occurs, this bit is valid only when 
debug in secure world is enabled, otherwise it has no effect. 

15 

Although a particular embodiment of the invention has been described herein, 
it will be apparent that the invention is not limited thereto, and that many 
modifications and additions may be made within the scope of the invention. For 
example, various combinations of the features of the following dependent could be 
20 made with the features of the independent claims without departing from the scope of 
the present invention. 
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CLAIMS 



1. A method of controlling a monitoring function of a processor, the 
processor being operable in at least two domains, comprising a first domain and a 

5 second domain, the first and second domains each comprising at least one mode, the 
method comprising the steps of: 

setting at least one control value, the at least one control value relating to a 
/ condition and being indicative of whether the monitoring function is allowable in the 
first domain; and 

1 0 °nly allowing initiation of the monitoring function in the first domain when the 

condition is present if its related control value indicates that the monitoring function is 
allowable. 

2. A method according to claim 1, wherein the first domain is a secure 
1 5 domain and the second domain is a non-secure domain. 

3. A method according to claim 1 or 2, wherein the condition comprises a 
domain, mode or type of monitoring function. 

20 4. A method according to claim 3 when dependent on claim 2, wherein the 

condition comprises a secure domain and the control value comprises a secure domain 
enable value, initiation of monitoring in the secure domain only being allowed if the 
secure domain enable value is set. 

25 5. A method according to claim 4 or claim 3 when dependent on claim 2 

wherein the secure domain includes a secure user mode and the condition comprises a 
secure user mode. 
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6. A method according to claim 5 wherein the control value comprises a 
secure user mode enable bit and initiation of monitoring from secure user mode is only 
allowed if the secure user mode enable bit has been set. 



79 



7. A method according to claim 3, wherein the condition comprises a type 
of monitoring function. 

5 8. A method according to claim 7, wherein the condition comprises a 

debug monitoring function and the control value comprises a debug enable bit, 
initiation of debug in the first domain only being allowable if the debug enable bit has 
been set. 

10 9. A method according to claim 7 or 8, wherein the condition comprises a 

trace monitoring function and the control value comprises a trace enable bit, initiation 
of trace in the first domain only being allowable if the control trace enable bit has been 
set. 

15 10. A method according to any one of claim 9 when dependent on claim 8 

and 4, wherein the secure domain enable value comprises a secure debug enable bit 
and a secure trace enable bit, initiation of debug and trace in the secure domain only 
being allowable if respective portions of the secure domain enable value are set. 

20 1 1 A method according to any one of the preceding claims, the method 

comprising setting a plurality of control values, each of the plurality of control values 
relating to a different condition; and 

only allowing initiation of the monitoring function in the first domain if any of 
the conditions are present if each of the control values related to a condition that is 

25 present indicate that the monitoring function is allowable. 

12. A method according to any one of the preceding claims, the method 
further comprising the steps of: 

setting a control indicator, the control indicator indicating that monitoring is 
30 only allowable for specified applications; and 

prior to initialising the monitoring function checking an application identifier; 

and 
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only allowing initiation of the monitoring function if the application currently 
running is one for which monitoring is allowable. 
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13. A method according to claim 12, wherein the step of setting a control 
indicator comprises setting a control indicator stored in a predetermined position in a 
storage element. 

14. A method according to claim 12 or 13, wherein the monitoring function 
comprises monitoring the processor and capturing diagnostic data, the method 
comprising the further step of: 

following initiation of the monitoring function only allowing the capturing of 
diagnostic data in the first domain while an application running on the processor is one 
for which monitoring is allowable. 



15. A method according to any one of claims 1 to 11, wherein the 
monitoring function comprises monitoring the processor and capturing diagnostic data, 
the method comprising the further step of: 

following initiation of the monitoring function only allowing the capturing of 
20 diagnostic data in the first domain when a condition changes if a control value related 
to the changed condition indicates that the monitoring function is allowable. 



16. A method according to any one of the preceding claims, wherein the 
setting of at least one control value is performed either by setting the control value via 

25 an input port or by setting the control value from the first domain. 

17. A method according to claim 16, the method comprising the further 
step of blocking write access to the control value via the input port such that the step of 
setting the control value can henceforth only be performed by setting the control value 

30 from the first domain. 
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18. A method according to any one of the preceding claims, wherein the 
first domain comprises a first user mode and a first privileged mode and the step of 
setting at least one control value in the first domain, comprises setting the control 
value from the first privileged mode. 

19. A method according to any one of claims 16 or 17, wherein the first 
domain comprises a first user mode and a first privileged mode and the step of setting 

'.■at least one control value in the first domain, comprises inputting an authentication 
i code from a mode that is not a first privileged mode and then setting the control value. 



20. A processor operable in a first domain and a second domain the first 
and second domains each comprising at least one mode, the processor comprising: 

monitoring logic; 

a storage element operable to be set to contain at least one control value, the at 
15 least one control value relating to a condition and being indicative of whether 
operation of the monitoring logic is allowable in the first domain; and 

control logic operable to control initiation of the monitoring logic and only to 
allow initiation of the monitoring logic in the first domain when the condition is 
present if its related control value indicates that operation of the monitoring logic is 
20 allowable. 

21. A processor according to claim 20, wherein the first domain is a secure 
domain and the second domain is a non-secure domain. 

25 22 - A processor according to claim 20 or 21, wherein the condition 

comprises a domain, mode or type of monitoring logic. 

23. A processor according to claim 22 when dependent on claim 21, 
wherein the condition comprises a secure domain and the control value comprises a 
30 secure domain enable bit, initiation of monitoring in the secure domain only being 
allowed if the storage element contains a secure domain enable bit. 
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24. A processor according to claim 23, or claim 22 when dependent on 
claim 21 wherein the secure domain includes a secure user mode and the condition 
comprises a secure user mode. 

5 25. A processor according to claim 24 wherein the control value comprises 

a secure user mode enable bit and the control logic is operable to allow initiation of the 
monitoring logic from secure user mode only when the storage element contains a 
secure user mode enable bit. 

10 26. A processor according to claim 21, wherein the condition comprises a 

type of monitoring function. 

27. A processor according to claim 26, wherein the condition comprises 
debug monitoring and the control value comprises a debug enable bit, the control logic 

15 being operable to allow initiation of the monitoring logic in the first domain only when 
the storage element contains a debug enable bit. 

28. A processor according to claim 26, wherein the condition comprises 
trace monitoring and the control value comprises a trace enable bit, the control logic 

20 being operable to allow initiation of the trace logic in the first domain only when the 
storage element contains a control trace enable bit. 

29. A processor according to any one of the preceding claims, wherein: 

the storage element is operable to contain a plurality of control values, each of the 
25 plurality of control values relating to a different condition; and 

the control logic is operable to only allow initiation of the monitoring logic in 
the first domain if any of the conditions are present if each of the control values related 
to a condition that is present indicate that the monitoring logic is allowable. 

30 30 - A processor according to claim 29 wherein one condition comprises a 

secure domain and the corresponding control value comprises a secure domain enable 
bit and a further condition comprises a secure user mode and the corresponding control 
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value comprises a secure user mode enable bit, the control logic being operable to 
initiate the monitoring logic from secure user mode only when the storage element 
contains both a secure user mode enable bit and a secure domain enable bit. 

5 3 1 . A processor according to any one of the preceding claims, wherein: 

the storage element is further operable to contain a control indicator, the control 
indicator indicating that monitoring is only allowable for identified applications; and 

the control logic is operable to check at least one identifier identifying an 
'application that is allowable, the control logic only initiating the monitoring logic in 
10 the first domain when the application currently running is one identified as being one 
for which monitoring is allowable. 

32. A processor according to claim 31, the processor comprising a further 
storage element, the storage element being operable to contain the at least one 

15 identifier specifying an application that is allowable. 

33. A processor according to claim 31 or 32, wherein the monitoring logic 
is operable to monitor the processor and capture diagnostic data; and 

wherein the control logic is operable to control the monitoring logic to suppress 
20 capturing of diagnostic data in the first domain when the control logic detects that the 
application running is not one identified as being allowable. 

34. A processor according to any one claims of 20 to 33, the processor 
further comprising an input port, wherein the control value is operable to be set in the 

25 storage element either via the input port or via an input from the first domain. 

35. A processor according to claim 34, the processor comprising a means of 
blocking write access to the control value via the input port such that the setting of the 
control value can henceforth only be performed by setting the control value via an 

30 input from the first domain. 



84 



36. A processor according to any one of claims 20 to 35, wherein the first 
domain comprises a first user mode and a first privileged mode and the control value is 
operable to be set in the storage element via an input from the first privileged mode. 

37. A processor according to any one of claims 35 or 36, wherein the first 
domain comprises a first user mode and a first privileged mode and the control value is 
operable to be set in the storage element by the input of an authentication code from a 
mode that is not a first privileged mode followed by an input of the control value. 

38. A processor according to any one of claims 20 to 37, wherein the 
storage element comprises a register. 



39. A processor according to any one of claims 30 to 38, wherein the 
further storage element comprises a register. 

15 

40. A method of controlling a monitoring function of a processor, 
substantially as hereinbefore described with reference to the accompanying drawings. 

41. A processor, substantially as hereinbefore described with reference to 
20 the accompanying drawings. 
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ABSTRACT 
Monitorin g Control for Multi-Domain Processors 

There is provided a method of controlling a monitoring function of a processor, 
the processor being operable in at least two domains, comprising a first domain and a 
second domain, the first and second domains each comprising at least one mode, the 
method comprising the steps of: setting at least one control value, the at least one 
control value relating to a condition and being indicative of whether the monitoring 
function is allowable in the first domain; and only allowing initiation of the monitoring 
function in the first domain when the condition is present if its related control value 
indicates that the monitoring function is allowable. In some embodiments the first 
domain is a secure domain and the monitoring function is a debug or trace function. 
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Figure 41 
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software (CP 14) 


Program breakpoint register and/or 
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succeed with Instruction Address and/or 
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Halt/monitor 

( 1 > 


Software breakpoint 
instruction 


Put a BKPT instruction into scan 
chain 4 (Instruction Transfer 
Register) through Debug TAP or 

Use BKPT instruction directly in 
the code. 


BKPT instruction must reach execution 
stage. 


Halt/monitor 


Vector trap breakpoint 


Debug TAP 


Program vector trap register and 
address matches. 


Halt/monitor 
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Program watchpoint.register and/or 
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Halt instruction has been scanned in. 
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C): In monitor mode, breakpoints and watchpoints cannot be data-dependent. 

( 2 ): The cores have support for thread-aware breakpoints and watchpointsiinc^^^ enable secure 

debug on some particular threads. '}' ri 
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R/W by programming the ICE by the JTAG (scanl) 
■ R/W by using MRC/MCR instruction (CP 1 4) 
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1 : debug in secure 
world and non- 
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In functional mode or debug monitor mode: R/W by 
using MRC/MCR instruction (CP 14) (only in secure 
supervisor mode) 

In Debug halt mode: No access - MCR/MRC 
instructions have any effect 
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possible in secure 
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instructions have any effect 

(R/W by programming the ICE by the JTAG (scanl) 
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no 
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CP 14 bits in Debug and Status Control register 


meaning 


Secure debug enable 
bit 


Secure user-mode 
debug enable bit 


Secure thread-aware 
debug enable bit 


0 
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No intrusive debug in entire secure world is possible. 
Any debug request, breakpoints, watchpoints, and other 
mechanism to enter debug state are ignored in entire 
secure world. 
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Debug in entire secure world is possible 
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: 1 




Debug in secure user-mode only. Any debug request, 
breakpoints, watchpoints, and other mechanism to enter 
debug state are taken into account in user mode only. 
(Breakpoints and watchpoints linked or not to a thread 
ED are taken into account). Access in debug is restricted 
to what secure user can have access to. 


1 




1 


Debug is possible only in some particular threads. In 
that case only thread-aware breakpoints and 
watchpoints linked to a thread ID are taken into account 
to enter debug state. Each thread can moreover debug 
its own code, and only its own code. 
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debug enable bit 


meaning 
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No observable debug in entire secure world is possible. 
Trace module (ETM) must not trace internal core 
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Trace is possible when the core is in secure user-mode 
only. 
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Trace is possible only when the core is executing some 
particular threads in secure user mode. Particular 
hardware must be dedicated for this, or re-use 
breakpoint register pair: Context ID match must enable 
trace instead of entering debug state. 
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Breakpoint hits 


Non-secure prefetch abort handler 


secure prefetch abort handler 


Software breakpoint instruction 


Non-secure prefetch abort handler 


secure prefetch abort handler 


Vector trap breakpoint 


Disabled for non-secure data abort 
and non-secure prefetch abort 
interruptions. For other non-secure 
exceptions, prefetch abort. 


Disabled for secure data abort and 
secure prefetch abort exceptions (*). 
For other exceptions, secure prefetch 
abort. 


Watchpoint hits 


Non-secure data abort handler 


secure data abort handler 


Internal debug request 


Debug state in halt mode 


debug state in halt mode 


External debug request 


Debug state in halt mode 


debug state in halt mode 



& SeeHr^cm^ ; vector trap register, ; 

(2) Note that when external or internal debug request is asserted, the core enters halt mode and not monitor mode. 
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( l ) As substitution of BKPT instruction in secure world from non-secure world is not possible, non-secure abort 
must handle the violation. 
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